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(57) Abstract 

The present invention concerns proteins encoded by a family of genes, tenned licre //Ox-related genes, which are involved in the 
control of chromatin structure and, thus in transcription and translation. The present invention makes available compositions and methods 
that can be utilized, for example, to control cell proliferation and differentiation in vitro and in vivo. 
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Government Funding 
Work described herein was supported in part by funding from the National 
Institute of Health. The United States Government has certain rights in the invention. 

Background of Ike Invendon 
The organization of regulatory DNA elements into precise chromatin structures is 
unportant for both DNA repUcation and transcription in vivo (Lee et al (1993) CeU 
72:73^; Felsenfeld (1992) Nature. 355:219). In eukatyotic cells, nuclear DNA exists as 
a hierarchy of chromatin structures, resulting in the compaction of nuclear DNA about 
10,000 fold (Davie and Hendzel (1994) J. CeU. Biochem. 55:98). The repeating 
structural unit b the extended 10 nm fibre fonn of chromatin is the nucleosome (van 
Holde(1988)£luomatiQ. New York: Springer-Verlag). The nucleosome consists of 146 
bp of DNA wrapped around a protein core of the histones H2A, H2B. H3, and H4, 
known as the core histones. These histones are arranged as an (H3.H4)2 tet^amer and 
two H2A-H2B dimers positioned on each face of the tetramer. The DNA joining the 
nucleosomes is caUed linker DNA; it is to the linker DNA to which the HI or linker 
histones bind. The 10 mn fibre is compacted fiirther into the 30 nm fibre. Linker 
histones and amino-terminal regions ("taDs") of the core histones maintain the higher 
order folding of chromatin (Garcia Ramirez et al.. (1992) J. Biol Chem 267:19587). This 
chromatin structure must be relaxed when DNA is transcribed or translated. 

IBstones of the nucleosome core particle are subject to reversible acetylation at 
tiie e-amino group of lysines present in their amino terminus (Csordas et al (1990) 
Biochem J 265:23-38). TranscripuonaUy silent regions of tiie genome are cmiched in 
undeiacetylated histone H4 (Turner (1993) Cell 75:5-8). and histone hyperacetylation 
facilitates the ability of transcription fector TFIHA to bind to chromatin templates (Lee et 
al. (1993) Cell lim-U). Recem genetic, biochemical and immunological approaches 
have provided substantial evidence indicating tiiat histones associated wiUi actively 
transcribed genes are more highly acetylated than those from nontranscribed regions 
While not wishin^to be bound bv anv camcUr theonr, histone acetylation may influence 
transcnpuon at several stages, for example, by causing transcription factors to bind or by 



wo 97/35990 



PCT/US97/05275 



-2- 

inducing structural transitions in chromatin, or by facilitating histone displacement and 
repositioning during polymerase elongation. 

The acetylation and deacetylation are catalyzed by specific enzymes, histone 
acetyltransferase and deacetylase, respectively, and the net level of the acetylation is 
5 controlled by the equilibrium between these enzymes. The steady state level of 
acetylation and the rates at which acetate groups are turned over vary both between and 
within different cell types, with half-lives that vary from a few minutes to several hours. 
Although a histone acetyltransferase gene (HATl) has been id^tified in yeast (Kelflf et 
al. (1995) y. Biol Chem. 270:24674-24677), the molecular entities responsible for 
10 histone deacetylation were heretofore unknown in the art. 

The identification of the mechanism by which histones are deacetylated would be 
of great benefit in the control of gene transcription and the cell cycle. 

Summary of the Invention 

15 The present invention relates to the discovery of a novel family of genes, and gene 

products, expressed in mammals, which genes are referred to hereinafter as the "histone 
deacetylase" genes or *'/«>r" gene family, the products of which are referred to as 
histone deacetylases or HDx proteins. 

In general, the invention features isolated HDx polypeptides, preferably 
20 substantially pure preparations of one or more of the subject HDx polypeptides. The 
invention also provides recombinantly produced HDx polypeptides. In preferred 
embodiments the polypeptide has a biological activity including an ability to deacetylate 
an acetylated histone substrate, preferably a substrate analog of histone H3 and/or H4. In 
other embodiments the HDx polypeptides of the present invention bind to trapoxin or to 
trichostatin, such binding resulting in the inhibition a deacetylase activity of the HDx 
polypeptide. However, HDx polypeptides which specifically antagonize such activities, 
such as may be provided by dominant negative mutants, are also specifically 
contemplated. 

The HDx polypeptides disclosed herein are capable of modulating proliferation, 
survival and/or differentiation of cells, because of their ability to alter diromatin structure 
by deacetylating histones such as H3 or H4. Moreover, in preferred embodiments, the 
subject HDx proteins have the ability to modulate cell growth by influencing cell cycle 
progression or to modulate gene transcription. 
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In one embodiment, the polypeptide is identical with or homologous to an HDx 
protein. Exemplary HDx polypeptide include amino acid sequences represented in any 
one of SEQ ID Nos 5-8. Related members of the HDx family are also contemplated, for 
mstance, an HDx polypeptide preferably has an amino acid sequence at least 85% 
5 homologous to a polypeptide represented by one or more of Uie polypeptides designated 
SEQ ED Nos: 5-8, though polypeptides with higher sequence homologies of, for example, 
88, 90% and 95% or are also contemplated. In one embodiment, the HDx polypeptide is 
encoded by a nucleic acid which hybridizes under stringent conditions wiUi a nucleic acid 
sequence represented in one or more of SEQ ID Nos. 1-4. Homologs of the subject iKQr 
10 proteins also include versions of the protein which are resistant to post-translation 
modification, as for example, due to mutations which alter modification sites (such as 
tyrosine, tiireonine, serine or aspargine residues), or which inactivate an enzymatic 
activity associated with the protein. 

The HDx polypeptide can comprise a fiiU lengtii protein, such as represented in 
SEQ ID No. 5, or it can comprise a fluent corresponding to particular motifs/domains, 
or to arbitrary sizes, e.g., at least 5, 10, 25. 50, 100, 150 or 200 amino acids in length. Iil 
preferred embodiments, the polypeptide, or fi-agment tiiereof, specifically deacetyiates 
histone H4. In other preferred embodiments, the HDx polypeptide includes both a v 
motif (SEQ ID No. 12) and a x motif (SEQ ID No. 14), preferably a v motif represented 
in tiie general formula SEQ ID No. 13, and a x motif represented in the general fonnula 
SEQ ID No. 15. 

In certain preferred embodiments, the invention features a purified or recombinant 
HDx polypeptide having a molecular weight in tiie range of 40kd to 60kd. For instance, 
preferred HDx polypeptides, have molecular weights in the range of 50kd to about 60kd, 
even more preferably in tiie range of 53-58kd. It will be understood that certain post- 
translational modifications, e.g., phosphorylation, prenylation and the like, can increase 
ti>e apparent molecular weight of tiie HDx protein relative to the unmodified polypeptide 
chain. 

The subject proteins can also be provided as chimeric molecules, such as in the 
form of fiision proteins. For instance, tiie HDx protein can be provided as a recombinant 
fiision protein which includes a second polypeptide portion, e.g., a second polypeptide 
having an amino add sequence unrelated (heterologous) to the HDx pol>peptide, e.g. tiie 
second polypeptide portion is glutathione-S-transferase. e.g. tiie second polypeptide 
portion is an enzymatic activity such as alkaline phosphatase, e.g. the second polypeptide 
35 portion is an epitope tag. 
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In yet another embodiment, the invention features a nucleic acid encoding a an 
HDx polypeptide, or polypeptide homologous thereto, which polypeptide has the ability 
to modulate, e.g., either mimic or antagonize, at least a portion of the activity of a wild- 
type HDx polypeptide. Exemplary //Z>x-encoding nucleic acid sequences are represented 
5 bySEQIDNos: 1-4. 

In another embodiment, the nucleic acid of the present invention includes a 
coding sequence which hybridizes under stringent conditions with one or more of the - 
nucleic acid sequences in SEQ ID Nos: 1-4. The coding sequence of the nudeic acid can 
comprise a sequence which is identical to a coding sequence represented in one of SEQ 
10 ID Nos: 1-4, or it can merely be homologous to one or more of those sequences. In 
preferred embodiments, the nucleic acid encodes a polypeptide >^ch specifically 
modulates, by acting as either an agonist or antagonist, the enzymatic activity of an HDx 
polypeptide. 

Furthermore, in certain preferred embodiments, the subject HDx nucleic acid will 
IS include a transcriptional regulatory sequence, e.g. at least one of a transcriptional 
promoter or transcriptional enhancer sequence, which regulatory sequence is operably 
linked to the HDx gene sequence. Such regulatory sequences can be used in to render 
the HDx gene sequence suitable for use as an expression vector. This invention also 
contemplates the cells transfected with said expression vector whether prokaiyotic or 
20 eukaryotic and a method for producing HDx protdns by employing said expression 
vectors. 

In yet another embodiment, the nucleic acid hybridizes under stringent conditions 
to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of either 
sense or antisense sequence of one or more of SEQ ID Nos: 1-4; though preferably to at 
25 least 25 consecutive nucleotides; and more preferably to at least 40, 50 or 75 consecutive 
nucleotides of either sense or antisense sequence of one or more of SEQ ID Nos; 1-4. 

Yet another aspect of the present invention concerns an immimogen comprising 
an HDx polypeptide in an immunogenic preparation, the immunogen being capable of 
elidting an immune response specific for an HDx polypeptide; e.g. a humoral response, 
30 e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the 
immunogen comprising an antigenic determinant, e.g. a unique determinant, fi-om a 
protein represented by one of SEQ ID Nos. 5-8. 

A still fiuther aspect of the present invention features antibodies and antibody 
preparations specifically reactive with an epitope of the HDx immunogen. 
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The invention also features transgenic non-human animals, e.g. mice, rats, rabbits, 
chickens, frogs or pigs, having a transgene, e.g.. animals which include (and preferably 
express) a heterologous fom of an HDx gene described herein, or which misexpress an 
endogenous HDx gene. e.g.. an animal in which expression of one or more of the subject 
5 HDx proteins is disrupted. Such a transgenic animal can serve as an animal model for 
studymg ceUuIar and tissue disorders comprising mutated or mis-expressed HDx alleles 
or for use in drug screening. 

The invention also provides a probe/primer comprising a substantially purified 
oLgomicleotide. wherein the oligonucleotide comprises a region of nucleotide sequence 
10 which hybridizes under stringent conditions to at least 12 consecutive nucleotides of 
sense or antisense sequence of SEQ ID Nos: M. or naturally occurring mutants thereof 
Nucleic acid probes which are specific for each of the HDx proteins are contemplated by 
the present mvention. e.g. probes which can discern between nucleic acid encoding a 
human or bovine HD In preferred embodiments, the probe/primer further includes a 
15 label group attached thereto and able to be detected. The labd group can be selected 
e.g., from a group consisting of radioisotopes, fluorescent compounds, enzymes and 
enzyme co-factors. Probes of the invention can be used as a part of a diagnostic te'st kit 
for Identifying dysfimctions associated with mis-expression of ^ HDx protein, such as 
for detecting in a sample of ceUs isolated from a patient, a level of a mideic add encoding 
20 a subject HDx protein; e.g. measuring an HDx mRNA levd in a cell, or determining 
whether a genomic HDx gene has been mutated or deleted. TTiese so called 
"probes/primers" of the invention can also be used as a part of "antisense" therapy which 
refers to administration or in situ generation of oUgonucleotide probes or tiieir 
denvatives which specifically hybridize (e.g. bind) under cellular conditions, with tiie 
25 ceUular mRNA and/or genomic DNA encoding one or more of the subject HDx proteins 
so as to mhibit expression of tiiat protein, e.g. by inhibiting transcription and/or 
translation. Preferably, tiie oligonudeotide is at least 12 nucleotides in length, though 
pnmers of 25, 40, 50, or 75 nudeotides in length are also contemplated. 

In yet anotfier aspect, tiie invention provides an assay for screening test 
30 compounds for inhibitors, or altemativdy, potentiators, of an interaction between an 
HDx proton and an HDx binding protdn or nuddc acid sequence. An exemplary 
method includes tiie steps of 0) combining an HDx polypeptide or fragment thereof, one 
or more HDx target polypeptide (such as a histone. SINS. RpAp48 or other protein 
which participates in HDx complexes. e.g.. one or more protdns having molecular 
35 weights of 250 kDa, 180 kDa. 55 kDa, 50 kDa. 42 kDa, 33-36 kDa and 30 kDa, see also 
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Example 3), and a test compound, e.g., under conditions wherein, but for the test 
compound, the HDx protein and target polypeptide(s) are able to interact; and (ii) 
detecting the formation of a complex which includes the HDx protein and target 
polypeptide(s) either by du-ectly quantitating the complex, the deacetylase activity of the 
5 HDx protein, or by measuiing inductive eflfeas of the HDx protein. A statistically 
significant change, such as a decrease, in the formation of the complex in the presence of 
a test compound (relative to what is seen in the absence of the^test. compound) is 
indicative of a modulation, e.g., inhibition, of the interaction between the HDx protein 
and its target polypeptide. 

10 Furthermore, the present invention contemplates the use of other homologs of the 

HDx polypeptides or bioactive firagments thereof to generate similar assay formats. In 
one embodiment, the drug screening assay can be derived with a fungal homolog of an 
HDx protein, such as RPD3, in order to identify agents which inhibit histone 
deacetylation in a yeast cell. 

IS Yet another aspect of the present invention concerns a method for modulating 

one or more of growth, differentiation, or survival of a mammalian cell by modulating 
HDx bioactivity, e.g., by inhibiting the deacetylase activity of HDx proteins, or disrupting 
certain protein-protein interactions. In general, whether carried out in vivo, in vitro, or 
in situ, the method comprises treating the cell with an effective amount of an HDx 

20 therapeutic so as to alter, relative to the cell in the absence of treatment, at least one of 
Q) rate of growth, (ii) differentiation, or (iii) survival of the cell. Accordingly, the 
method can be carried out with HDx therapeutics such as peptide and peptidomimetics or 
other molecules identified in the above-referenced drug screens which antagonize the 
effects of a naturally-occurring HDx protein on said cell. Other HDx therapeutics include 

2S antisense constructs for inhibiting expression of HDx proteins, and dominant negative 
mutants of HDx proteins which competitively inhibit protein-substrate and/or protein- 
protein interactions upstream and downstream of the wild-type HDx protein. 

In an exemplary embodiment the subject method is used to treat tumor cells by 
antagonizing HDx activity and blocking cell cycle progression. In one embodiment, the 

30 subject method includes the treatment of testicular cells, so as modulate spermatogenesis. 
In another embodiment, the subject method is used to modulate osteogenesis, comprising 
the treatment of osteogenic cells with an HDx polypeptide. Likewise, where the treated 
ceil is a chondrogenic cell, the present method is used to modulate chondrogenesis. In 
still another embodiment, HDx polypeptides can be used to modulate the differentiation 

35 of progenitor cells, e.g., the method can be used to cause differentiation of a 
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hematopoieuc cells, neuronal ceUs, or other stenj/progenitor cell populations, to maintain 
a that cell in a differentiated state, and/or to enhance the survival of a diflferentiated cell, 
e.g., to prevent apoptosis or other forms of cell death. 

In addition to such HDx therapeutic uses, anti-fungal agents developed with such 
5 screening assays as described herein can be used, for example, as preservatives in 
foodstuff, feed supplement for promoting weight gain in livestock, or in disinfectant 
formulations for treatment of non-living matter, e.g., for decontaminating hospital 
equipment and rooms. In similar feshion, assays provided herein will permit selection of 
deacetylase inhibitors which discriminate between the human and insect deacetylase 

10 enzymes. Accordingly, the present invention expressly contemplates the use and 
formulations of the deacetylase inhibitors in insecticides, such as for use in management 
of insects like the fiuit fly. Moreover, certain of the inhibitors can be selected on the 
basis of inhibitory specificity for plant fflJr-rdated activities relative to the mammalian 
enzymes. Thus, the present invention specifically contemplates fonnulations of 

15 deacetylase inhibitors for agricultural applications, such as in the foim of a defoUant or 
the like. 

The present method is applicable, for example, to cell culture technique, such as 
in the culturing of hematopoietic ceUs and other cells whose survival or differentiative 
state is dependent on HDx function. Moreover, HDx agonists and antagonists can be 
20 used for therapeutic intervention, such as to enhance survival and maintenance of ceUs, as 
well as to influence organogenic pathways, such as tissue patterning and other 
differentiation processes. In an exemplary embodiment, the method is practiced for 
modulating, in an animal, ceU growth, cell differentiation or cell survival, and comprises 
administering a therapeutically effective amount o£ an HDx polypeptide to alter, relative 
the absence of HDx treatment, at least one of (0 rate of growth, fii) differentiation, or 
Oii) survival of one or more cell-types in the animal. 

Another aspect of the present invention provides a method of determining if a 
subject, e.g. a human patient, is at risk for a disorder characterized by unwanted ceU 
proliferation or aberrant control of differentiation. The method includes detecting, in a 
tissue of the subject, the presence or absence of a genetic lesion characterized by at least 
one of CO a mutation of a gene encoding an HDx protein, e.g. represented in one of SEQ 
ID Nos: 1-4, or a homolog thereof; or C") the mis-expression of an HDx gene. In 
preferred embodiments, detecting the genetic lesion includes ascertaining tiie existence of 
at least one of; a deletion of one or more nucleotides fi-om an HDx gene; an addition of 
one or more nucleotides to the gene, a substitution of one or more nucleotides of the 
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gene, a gross chromosomal rearrangement of the gene; an alteration in the level of a 
messenger RNA transcript of the gene; the presence of a non-wild type splicing pattern of 
a messenger RNA transcript of the gene; or a non-wild type level of the protein. 

For example, detecting the genetic lesion can include (i) providing a probe/primer 
5 including an oligonucleotide containing a region of nucleotide sequence which hybridizes 
to a sense or antisense sequence of an /©x gene, e.g. a nucleic acid represented in one of 
SEQ ID Nos: 1-4, or naturally occurring mutants thereof, or 5* or 3' flanking sequences 
naturally associated with the HDx gene; (ii) exposing the probe/primer to nucleic acid of 
the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the 

10 presence or absence of the genetic lesion; e.g. wherein detecting the lesion comprises 
utilizing the probe/primer to determuie the nucleotide sequence of the HDx gene and, 
optionally, of the flanking nucleic acid sequences. For instance, the probe^rimer can be 
employed in a polymerase chain reaction (PGR) or in a ligation chain reaction (LCR). In 
alternate embodiments, the level of an HDx protein is detected in an inununoassay using 

1 5 an antibody which is specifically immunoreactive with the HDx protein. 

In another aspect, the invention provides compounds usefiil for inhibition of 
HDxs. In a preferred embodiment, an HDx inhibitor compound of the invention can be 
represented by the formula A-B-C, in which A is a specificity element for selective 
binding to an HDx, B is a Imker element, and C is an electrophilic moiety capable of 
20 reacting with a nucleophilic moiety of an HDx\ with the proviso that the compound is not 
butyrate, trapoxin, or trichostatin. 

For instance, in one embodiment, there is provided a composition for inhibiting a 
histone deacetylase comprising a compound represented by the general formula A-B-C. 
wherein 

25 A is selected firom the group consisting of cycloallQrls, unsubstituted and 

substituted aryls, heterocyclyls, amino acyls, and cyclopeotides; 

B is selected firom the group consisting of substituted and unsubstituted C4-Cg 
alkylidenes, C4-C8 alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in vAnch D and F 
are, independently, absent or represent a C2-C7 alkylidene, a Ci-Oj alkenjdidene or a C2- 
30 C7 alkynylidene, and E represents 0, S, or NR', in which R* represents H, a lower alkyl, a 
lower alkenyl, a lower alkynyl, an aralkyl, aryl, or a heterocyclyl; and 

C is selected firom the group consisting of 
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Y O 




H 

R7 ^N. ^R7 




, and a boronic acid; in 



which Z represents O, S. or NR5, and Y; R5 represents a hydrogen, an alJcyl, an 
alkoxycarbonyl, an aiyioxycarbonyl, an alkykulfonyl. an aiylsulfonyl or an aiyl; R'g 
represents hydrogen, an alkyl. an alkenyl, an alkynyl or an aiyl; and R7 represeiits a 
5 hydrogen, an alkyl, an aiyl, an alkoxy. an aiyloxy, an kmino, a hydroxylamino. an 
alkoxylamino or a halogen; with the proviso that the compound is not trapoxin. 

In another preferred embodiment, the compound represented by the general 
formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
0 substituted aryis, heterocyclyls, amino acyls, and cydopeptides; 

B is selected from the group consisting of substituted and unsubstituted C4-C8 
alkylidenes, C4-C8 alkenyUdencs, C4-Cg alkynylidenes, and -{D-E-F>, in which D and F 
are, independently, absent or represent C2-C7 alkylidenes, C2-C7 alkenylidenes or C2-C7 
alkynyUdenes, and E represents O, S, or MR', in which R' represents H, a lower alkyl, a 
5 lower alkenyl, a lower alkynyl, an aralkyl, an aiyl, or a heterocydyl; and 

C is selected from the group consisting of 



an aiyl, a hydroxyl, an alkoxy, an ao^oxy or an amino, 

with the proviso that the inhibitor compound is not trichostatin. 

In stiU another preferred embodiment, the compound is represented by the general 
formula A-B-C, wherein 

A is sdected from the group consisting of cydoalkyls, unsubstituted and 
substiuited aiyls, heterocyclyls, amino acyls, and cydopeptides; 

B is sdected from the group consisting of substituted and unsubstituted C4-C8 
alkylidenes, C4-C8 alkenylidenes. C4-C8 alkynyUdenes, and .(D-E-F>, in which D and F 
are, independently, absent or a C2-C7 alkylidene, a C2-C7 alkenylidene, or a C2-C7 
aikynylidene, and E represents O, S, or NR', in which R' is H, lower alkyl, lower alkenyl, 
lower alkynyl, aralkyl, aiyl, or heterocydyl; and 




, in which R9 represents a hydrogen, an alkyl, 
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C represents ; in which Y is O or S, and R7 represents a hydrogen, an 

alkyl, an aryl, an alkoxy, an aryloxy, an amino, a hydroxyiamino, an alkoxylamino or a 
halogen. 

The present invention also contemplates pharmaceutical preparations of such 
5 compounds, e.g., in an amount effective for inhibiting proliferation of a cell, formulated 
in a pharmaceutically acceptable diluent. 

Moreover, such compounds can be used for modulating one or more of growth, 
differentiation, or survival of a mammalian cell responsive to /^Z>r-mediated histone 
deacetylation, by treating the cell with an effective amount of the deacetylase inhibitor so 
10 as to modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the rate of 
survival of the cell 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, transgenic 

15 biology, microbiology, recombinant DNA, and immunology, which are within the skill of 
the art. Such techniques are explained fiilly in the literature. See, for example, Molecular 
Cloning A Laboratory Manual, 2nd Ed., ed, by Sambrook, Fritsch and Maniatis (Cold 
Spring Harbor Laboratory Press; 1989); DNA Cloning, Volumes I and n (D. N. Glover 
ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent 

20 No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Kggins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of 
Animal Cells (R. I Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes 
(JRL Press, 1986); B. Peibal, A Practical Guide To Molecular Cloning (1984); the 
treatise, Methods In Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors 

25 For Mammalian Cells (L H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor 
Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.), 
Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds.. 
Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I- 
IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, 

30 (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1986). 
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Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 



10 
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Brief Description of the Drawings 

Figure lA illustrates the chemical structures of trapoxin and trichostatin, natural 
products that inhibit the CMTmatic deawtxlation of lysine residues near the NHa-terminus 
of histones. The epoxyketone side chain of trapoxin is approximately isosteric with N- 
acetyl lysine and likely alkylates an active site nucleophile. 

Figure IB illustrates the copurification of trapoxin binding and histone 
deacetylase activities. Nuclear proteins from bovine thymus were precipitated with 
ammonium sulfate and fractionated on a Mono Q column. Trapoxin binding was 
assayed by charcoal precipitation with [3H]trapoxin. For the histone deacetylase assay, 
a peptide corresponding to bovine histone H4 (1-24) was synthesized. The peptide wa^ 
chemically acetylated with sodium piflacetate (5.3 Ci/mmol. New England 
Nuclear)/BOP reagent (Aldrich) and purified by reverse phase HPLC. Two microUters of 
[3H]peptide(~40,000dpm) were used per 200 ^1 assay. After incubation at 370C for one 
hour, the reartion was quenched with 1 M HCl/0.16 M acetic acid (50 nl). Released 
pHjacetic acid was extracted with 600 \i\ of ethyl acetate and quantified by scintiUatton 
counting. Pretreatment of erode or partially purified enzyme with trapoxin or trichostatin 
(20nM) for 30 min. at 40C abolished deacetylase activity. A28o= absorbance at 280 nm. 

Figure 2A shows the synthesis of K-trap and the K-trap affinity matrix. K-trap 
contains a protected lysine residue in place of the phenylalanine at position two m 
trapoxin. Alloc = allyloxycarbonyl. 

Figure 2B is a silver stained gel showing bovine and human trapoxin binding 
proteins. Proteins bound to the K-trap affinity matrix in the presence or absence of 
trapoxin or trichostatin were eluted by boiling in SDS loading buflfer and analj-zed by 
SDS-PAGE (9% gel). Nuclear proteins from human Jurkat T cells were prepared 
identically to those from bovine thymus (Figure IB). Molecular size standards (b 
kilodaltons) are indicated to the right. 

Figure 3A is the predicted amino acid sequence of human HDl. An in-frame stop 
codon was found upstream of the starting methionine. Regions equivalent to 
microsequenced tiyptic peptides from the purified bovine protein are boxed. Underlined 
amino acids 319-334 and 467-482 denote the sequences of synthetic peptides that were 
conjugated to KLH and used to generate polyclonal antisera. Abbreviations for the 
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amino acid residues are: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ue; K, 
Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y,Tyr. 

Figure 3B is a protein immunoblot analogous to the silver stained gel in Figure 
5 2B, showing the relationship between bovine p46-p49 and human p55 (top panels) and 
confirmmg the identity of p50 (bovine and human) as RbAp48 (bottom panels). Proteins 
eluted from the K-trap affinity matrix (Figure 2) were separated by SDS-PAGE and 
transferred to Immobilon-P (Millipore). Blots were probed with polyclonal anti-//Z)l 
(319-336) or monoclonal anti-RbAp48 and bound antibodies were detected with 
10 enhanced chemiluminescence (Amersham). 

Figure 4 A is an immunoprecipitation of endogenous histone deacetylase activity 
with affinity purified anti-i/Z) 1(467-482) antibodies. Anti-/«)1(467.482) 
inmiunoprecipitates from equivalent amounts of Jurkat nuclear extract (1 mg nuclear 
protein supplemented with 0.5 M NaCl, 1% BSA, and 0.1% NP-40) were isolated in the 
15 presence or absence of /H) 1(467-482) peptide competitor. After resuspending the 
immunoprecipitates in HDx bufifer [20 mM tris (pH 8), 150 mM NaCl, 10% glycerol], 
inhibitors were added as indicated, and histone deacetylase activity was measured as 
described in Figure 1 A. 

Figure 4B shows the coprecipitation oiHDI and RbAp48, as detected by protein 
20 immunoblot analysis. 

Figure 4C demonstrates the histone deacetylase activity of recombinant HDl~F, 
Tag Jurkat cells (Clipstone et al. (1992) Nature 357, 695-7) were transfected with pFJ5 
(vector alone) or pBJ5////)l-F (encoding COOH-terminal FLAG epitope tagged HDJ) 
by electroporation and detergent lysates were prepared [0.5% Triton X-100, 50 mM tris 
25 (pH 8), 100 mM NaCl, 10% glycerol]. Anti-FLAG antibodies conjugated to agarose 
beads (TBI) were used to immunoprecipitate recombinant HDJ in the presence or absence 
of FLAG peptide competitor, and histone deacetylase activity was measured as described 
above. 

Figure 4D shows the interaction between recombinant HDl-F and the K-trap 
30 affinity matrix. Lysates from Jurkat cells transfected with pBJ5/HDl-F were incubated 
with the K-trap affinity matrix in the presence or absence of inhibitors. Immunoblots of 
the eluted proteins were probed with the anti-FLAG M2 monoclonal antibody (BI). 

Figures 5A and 5B are sequence alignments for various HDx and //Dx-related 
cDNAs and proteins, respectively. 
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Figure 6 depicts exemplary specificity elements (A), linker elements (B), and 
electrophilic moieties (C) for generating compounds which are capable of reacting with a 
nucleophilic moiety of an HDx protein. 

Figure 7 illustrates an exemplary synthesis of trichostatin analogs. 
5 Figures 8A-8C illustrate a syndesis of tritiatedTrapoxinB. 

Figures 9A-9C depict a sy^esis of the K-trap and K-trap aflSnity matrix . 
Figures lOA-lOB: mSinSA is present in cells as a large stable multiprotein 
complex. Nuclear lysates were prepared fi-om U937 cells metabolicaUy labeled with 
[35]S-methionine and low stringency immunoprecipitations performed with antiserum 
10 specific for mSin3A. "+ block" shows proteins immunopredpitated when the anti- 
mSin3 A was preincubated with purified GST-PAH2 (A). In (B), low stringency mSin3A 
immunoprecipitates were washed for an additional 60 minutes using the salt and 
detergent conditions indicated at the top of the Figure. In (A) and (B), the 
immunoprecipitates were analyzed by SDS-PAGE and autoradiography. Apparent 
15 molecular weight of the copredpitating proteins and the sizes of the molecular weight 
markers are given in kilodaltons. 

Figures llA-D: mSinSA and EMACl associate in vivo. Immunopredpitations 
were performed using nuclear extracts from pSjS-methionine labeled U937 ceUs. (A) The 
left lane shows proteins from an anti-mSin3A immunoprecipitate. The right lane shows 

20 proteins duted from an anti-mSin3A immunoprecipitate and reprecipitated with anti- 
serum specific for HDl. In (B) and (C), low stringency immunoprecipitations were 
performed using antiserum specific for the carboxy-terminus of HD 1 . "+ block" indicates 
that the HDl antiserum was preincubated with the inununizing peptide. In (C), proteins 
immunoprecipitated with anti-mS3A are shown for inference, proteiijs duted from a low 

25 stringency anti-HDl immunopredpitate and repredpitated with anti-mSin3A are shown 
in the right most lane. In (A), (B), and (C), autoradiographs of SDS-PAGE gds are 
shown. Apparent molecuUr weight of the copredpitating, protdns and the sizes of the 
molecular wdght maricers are given in kilodaltons. In (D), in vitro histone deacetylase 
acUvity in anti-mSin3A immunoprecipiutes is shown. Human Jurkat cell extracts (12 

30 mg) were immunopredpitated using anti-mSin3A polydonal antibodies, "+block" 
indicates that the anti-mSin3A antibody was prdncubated with GST-PAH2, "+10 nM 
trapoxin" indicates that the immunopredpitated proteins were pretreated with 10 nM 
trapoxin for 30 minutes at 4"C prior to bdng assayed for histone deacetylase activity. 
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Figures 12A-C: RhAp48 is associated with mSinSA in vivo and recombinant 
HDJ, RbAp48 and mSin3A copurify from insect cell extracts. (A) TAg Jurkat cell 
lysates were immunoprectpitated using antibodies specific to C-terminalal portion of HDl 
(left) or antibodies specific to PAH2 of mSinSA (right). Parallel immunoprecipitations 
5 were blocked as described in figure 11. Immunopurified proteins were analyzed by SDS- 
PAGE and immunoblotted with (x-RbAp48 monoclonal antibody 12B1. (B & C) Equal 
amounts of Jbaculovirus coinfectcd Sf9 cell extracts were aflSnity purified using Ni2+- 
agarose "Ni" or or x-FLAG-M2-agarose "F". Purified recombinant proteins were 
analyzed by SDS-PAGE, transferred to Imraobilon-P (Millipore) and immunoblotted with 
10 FLAG to detect HDl-F (B) or (x-Flu (12CA5) to detect p48-HA (C), We observe a 
reduction in expression of HDI-F and p48-HA when coexpressed with mSin3 A 

Figures 13A-C: Trapoxin reverses transcriptional repression by mSinSA. (A) 
The structure of the minimal reporter gene derived fi-om the myelomonocytic growth 
factor gene and the expresMon vectors. Mad(Pro)N35GALVPI6 has leucine at position 

15 12 and alanine at position 16 mutated to proline as indicated. These point mutations 
prevent association between mSin3A and Mad (Ayer et al., 1995). The transcriptional 
activity of MadN35GALVP16 and Mad(Pro)N35GALVP16 was determined by 
measuring luciferasc activity (Relative Light Units, RLU) of transfected 293 cells 
following an 8 hour treatment with 0 (solid bars) or 10 nM trapoxin (striped bars) (B), 

20 To control for differences m transfection efficiency, the RLU values were normalized to 
the p-galactosidase activity produced by a cotransfected CMV-pGAL construct. Shown 
is data fi-om representative experiment and the error is reported as the standard error of 
the mean (s.e.m). This experiment has been done a minimum of five times in triplicate 
with similar results. An 8 hour treatment of 293 cells with 10 nM trapoxin is within the 

25 linear range of the response of the reporter gene. Furthermore, trapoxin treatment did 
not prevent association between mSin3A and HDl (data not shown). (C), trapoxin 
inhibits histone deacetylase activity of human 293 cells in vivo. 2 x 10* cells were 
cultured for 8 hours in the absence, "O", or in the presence of 10 nM trapoxin. Cells 
were harvested and crude extracts fi-om approximately 1 x lO'^ cells (solid bars) or anti- 

30 HDl immunoprecipitations of extracts fi^om approximately 4 x 10*^ cells (white bars) 
were assayed for histone deacetylase activity in vitro. 
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Detailed Description of the Invention 

The positioning of nudeosomes relative to particular regulatory elements in 
genomic DNA has emerged as a mechanism for managing the association of sequence- 
specdic DNA-binding proteins with promoters, enhancers and other transcriptional 
5 regulatory sequences. Two modifications to nudeosomes have been observed to 
influence the association of DNA-binding proteins with chromatin. Depletion of histones 
H2AmB from the nudeosome facilitates the binding of RNA polymerase H (Baer et al 
(1983) Nature 301:482-488) and TFIIIA (Hayes et al. (1992) PNAS 89:1229-1233) 
Likewise, acetylation of Uie core histones apparentiy destabUizes the nudeosome and is 
10 thought to modulate the accessibility of transcription factors to their respective enhancer 
and promoter elements (OUva et al. (1990) Nuc Acid Res 18:2739-2747- and Walker et 
al. (1990) J Biol Chem 265:5622-5746). In both cases, overall histone^DNA contacts 
are altered. 

In one aspect, tiie present invention concerns the discovery of a family of genes in 
15 mammals, the gene products of which are referred to herein as "histone deacetylases" or 
"HDxfs". Experimental evidence indicates a fiinctional role for the HDx gene products as 
catalysts of the deacetylation of histones in mammalian ceUs, and accordingly play a role 
in determining tissue fete and maimenance. For instance, the results provided below 
mdrcate tiiat proteins encoded by the HDx genes may partidpate. under various 
20 circumstances, in the control of proliferation, differentiation and cell death. 

The family of gene apparentiy encode at least tiiree different sub-famiUes. 
e.g.. paralogs, and have been identified from tiie cells of various mammals. The HDx 
proteins were first isolated from bovine thymus nucld by use of a binding assay which 
exploited the ability of trapoxin, an inhibitor of histone deacetylase activity, to isolate 
25 proteins which co-purified wiU a histone acetylase activity. The partial identity of tiie 
isolated proteins were determined by peptide microsequencing. and primers based on the 
peptide sequences were used to done human cDNAs from a T cdl Ubraiy. One of tiie 
HDx gene products described below is referred to herein as HDl, and is represented in 
SEQ ID No. 1 (nudeotide) and SEQ ID No. 5 (amino add). 

30 A search of expressed sequence tag (EST) libraries turned up partial sequences 

for human HDx transcripts, and revealed tiie existence of at least two otiier human HDx 
genes related to HDI, these otiier paralogs referred to herein as HD2 and HD3 
Nudeot.de and amino acid sequences for partial dones of other human HDx homologs 
are provided by SEQ ID Nos. 2A and 6-8. respectively 
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Analysis of the HDx sequences indicated no obvious similarities with any 
previously identified domains or motifs. However, the feet that each full-length clone 
lacks a signal sequence, along with the observation that proteins can be detected in the 
nucleus, indicates that the HDx genes encode intracellular proteins. 

5 Careful inspection of the HDx clones suggests at least two novel motifs, one or 

both of which may be characteristic of at least subfamilies of the mammalian HDx family. 
The first apparently conserved structural element of the HDx family occurs in the N- 
tcrminal portion of the molecule, and is designated herein as the "v motif. With 
reference to human HDl^ the v motif corresponds to amino acid residues Aspl30- 

10 Phel98. By alignment of the human HDx sequences, the element is represented by the 
consensus sequence: 

DXXXNXXGGIilHAKKXEASGFCYXNDIVXXIXELl^^ 
VMTXSF, (SEQ ID No. 12) 

more preferably by the consensus sequence: 

1 5 DIAXiNWAGGIlfiiAKKX2EASGFCYVh©IVX3X4lLEL^ 
7TD.RVMTVSF (SEQ ro No. 13) 

wherein each of represents any single amino acid, though more preferably represents 
an amino acid residue in the corresponding human HDx sequences of the appended 
sequence listing. 

20 A second motif, herein designated the % motif is represented by the consensus 

sequence: 

CVXXXKXFXXPXXXXGGGGYTXRNVARXWXXET (SEQ ID No, 14) 
more preferably by the consensus sequence: 

CVEXgVKX9FNXi(>PIJJCnLGGGGYTXi2RNVARCWTYET (SEQ ID No: 15) 

25 wherein each of Xn represents any single amino acid, though more preferably represents 
an amino add residue in the corresponding human HDx sequences of the appended 
sequence listing. The x motif can be found in the human HDl sequence at C284-Thr3 1 6. 

The femily of HDx proteins apparently ranges in size fi*om about 40kd to about 
60kd for the unmodified polypeptide chain. For instance, the bovine HDJ protein 
30 migrates on an SDS-PAGE (9%) gel with an apparent molecular weight of 46kD. The 
human HDJ anuno acid sequence predicts a molecular weight for the polypeptide chain 
ofSSkD. 
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Accordingly, certain aspects of the present invention relate to nucleic acid's 
encoding HDx proteins, the HDx proteins themselves, antibodies immunoreactive with 
HDx proteins, and preparations of such compositions. Moreover, the present invention 
provides diagnostic and therapeutic assays and reagents for detecting and treating 
disorders involving, for example, aberrant expression (or loss tiiereof) of HDx homologs. 
In addition, drug discovery assays are provided for identifying agents which can modulate 
the biological function of HDx proteins, such as by altering the binding of HDx molecules 
to either proteins or nucleic acids. Such agents can be usefiU therapeuticaUy to alter tiie 
growth and/or diflferentiation of a ceU. Otiier aspects of tiie invention are described 
below or will be apparent to those skflled in the art in light of the present disclosure. 

Analysis of the human HDx sequences. whUe not revealing any obvious 
similarities to known domains or motifs, did indicate similarities witfi previously identified 
proteins from both Saccharomyces cerevisiae and Xenopus laevis. Those genes, RPD3 
(SEQ ID No. 9) and Xe-RPD3 (SEQ ID No. 10). respectively, had not previously been 
15 ascribed any specific function. However, based on our observations for the function of 
HDl, it is now apparent that each of these otiter proteins are also deacetylases, and 
represent potential tiierapeutic targets. Accordingly, drug discoveiy assays are proiaded 
for identifying agents which can modulate tiie biological fimction of "//Z^tc-related" 
proteins, such as RPD3 homologs, by altering the enzymatic activity of the deacetylase, 
or its binding to otiier cellular components including homologs of RbAp48 (described 
infra). Such agents can be useful therapeutically to alter the growtii and/or differentiation 
of non-human ceUs, such as in the treatment of mycotic mfections, or as additives to 
livestock feed, e.g., to promote weight gain, or as topical antiseptics for sterilizing 
medical equipment. 

In addition we isolated another bovine protein having an approximate molecular 
size of 50kD which apparently binds HDx proteins isolated by the trapoxin matrix, and 
microsequencing of tiiat protein demonstrated tfiat it was related to tiie protein referred 
to in tiie art as RbAp48 (Qian et al. (1993) Nature 364:648; SEQ ID No. 1 1). RbAp48 
was originally identified as a protein tiiat binds to tiie retinoblastoma (Rb) gene product. 
The retinoblastoma (RB) gene product plays a role in tumor suppression (Weinberg, 
R.A., (Sept 1988) Scientific Amer.^^ 44-51; Hansen et al. (1988) Trends Genet 4.125- 
128). The role of RB as a himor-suppressor protein in cell-cycle control is believed to be 
similar to tiiat of another tumor-suppressor, p53 (Green (1989) CeU 56:1-3; Mowat et al 
(1985 Nature 314:633-636). Inactivation or muUtion of the second RB allele in one of 
tiie somatic cells of these susceptible individuals appears to be the molecular ev nt tiiat 
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leads to tumor formation (Caveneee et al. (1983) Nature 305:799-784; Friend et al. 
(1987) PNAS 84:9059-9063). 

The growth suppression function of the retinoblastoma protein is thought to be 
mediated by Rb binding to cellular proteins. RbAp48 is one of the major proteins that 

5 binds to a putative functional domain at the carboxy terminus of the Rb protein. 
Complex formation between RbAp48 and Rb occurs in vitro and in vivo, and apparently 
involves direct interaction between the proteins. Like Rb, RbAp48 is a ubiquitously 
expressed nuclear protein. RbAp48 share sequence homology with MSU, a negative 
regulator of the Ras-Qrclic AMP pathway in the yeast Saccharomyces cerevisiae. 

10 Furthermore, like Aff//, human RbAp48 suppresses the heat-shock sensitivity of the 
yeast iraJ strains and RAS2VaU9 strains. Interaction with RbAp48 may be one of the 
mechanisms for suppres^on of growth mediated by Rb. Accordingly, the interaction of 
RbAp48 with HDx proteins fiirther implicates the HDx proteins in cell-cycle regulation. 

The RpAp48 interaction with HDx and /«>r-related proteins rq)rescnts yet 
15 another therapeutic target. Accordingly, drug discovery assays are provided for 
identifying agents which can modulate the interaction of RbAp48 proteins and the like 
with ^Zfcc-related proteins. Such assays can be derived to detect the ability of a test 
agent to alter protein-protein contacts, or to alter the enzymatic activity of the 
deacetylase in complexes including an RbAp48 protdn (e.g., were such complexes 
20 allosterically modulate the HDx enzymatic activity). As above, such agents can be useful 
therapeutically to alter the growth and/or differentiation of cells. 

Members of the Mad family of BHLHZip proteins heterodimeiize with Max to 
repress transcription in a sequence-specific manner. Transcriptional repression by 
Mad:Max heterodimers is mediated by ternary complex formation wth either of the 

25 corepressors mSin3A or mSmSB. Example 3 demonstrates that Sin3 protons are an in 
vivo component of large, heterogeneous multiprotein complexes and is tightly and 
specifically associated with at least seven polypeptides. Two of the Sin3-associated 
proteins, p50 and p55, are members of the histone deacetylase family described herdn. 
Sin3 immunecomplexes possess histone deacetylase activity that is senritive to the 

30 specific deacetylase inhibitor trapoxin. Sin3 targeted repression of a reporter gene is 
reduced by trapoxin treatment, suggesting that histone deacetylation mediates 
transcriptional repression through Mad-Max-Sin3 A multimeric complexes. 

The Sin3 interaction vath HDx and M>x-related proteins represents still another 
therapeutic target. Thus, in one aspect of the present invention there is provided drug 
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discovery assays for identifying agents which can modulate the interaction of Sin3 
proteins and the like with fiCbr-related proteins. Such assays can be derived to detect the 
ability of a test agent to alter protem-protein contacts, or to alter the enzymatic activity 
of the deacetylase in complexes including Sin3 or other transcriptional regulatory 
5 proteins. As above, such agents can be useful therapeutically to alter the growth and/or 
diflferentiation of cells. 

For convenience, certain terms employed in the specification, examples, and ~ 
appended claims are collected here. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
10 deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The 
term should also be understood to include, as equivalents, analogs of either RNA or 
DNA made fi-om nucleotide analogs, and, as applicable to the embodiment being 
described, single (sense or antisensc) and double-stranded polynucleotides. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
15 comprising an open reading fi^e encodmg one of the HDx polypeptides of the present 
invention, including both exon and (optionally) intron sequences. A "recombinant gene" 
refers to nucleic acid encoding an HDx polypeptide and comprising /ffiir-encoding exon 
sequences, though it may optionally include intron sequences which are either derived 
from a chromosomal HDx gene or from an unrelated chromosomal gene. Exemplary 
20 recombinant genes encoding the subject HDx polypeptide are represented in the 
appended Sequence Listing. The term "intron" refers to a DNA sequence present in a 
given HDx gene which is not translated into protein and is generally found between 
exons. 

As used herein, the terra "transfection" means the introduction of a nucleic acid, 
25 e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. 
"Transformation", as used herein, refers to a process in which a cellos genotype is 
changed as a result of the cellular uptake of exogenous DNA or RNA, and, for example, 
the transformed cell expresses a recombinant fonn of m HDx polypeptide or, where anti- 
sense expression occurs from the transferred gene, the expression of a naturally- 
30 occurring fonn of the HDx protein is disrupted. 

As used herein, the term "spedficaUy hybridizes" refers to the ability of the 
probe/primer of the invention to hybridize to at least 15 consecutive nucleotides of an 
HDx gene, such as an HDx sequence designated in one of SEQ ID Nos: 1-4, or a 
sequence complementary thereto, or naturally occurring mutants thereof, such that it has 
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less than 15%, preferably less than 10%, and more preferably less than 5% background 
hybridization to a cellular nucleic acid (e.g., mRNA or genomic DNA) encoding a protein 
other than an HDx protein, as defined herein. In preferred embodiments, the 
oligonucleotide probe specifically detects only one of the subject HDx paralogs, e.g.. 
5 does not substantially hybridize to transcripts for other HDx homologs in the same 
species. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of preferred 
vector is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. 

10 Preferred vectors are those capable of autonomous replication and/expression of nucleic 
acids to which they are linked. Vectors capable of directing the expression of genes to 
which they are operatively linked are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
fonn of "plasmids" which refer generally to circular double stranded DNA loops which, 

15 in their vector form are not bound to the chromosome. In the present specification, 
"plasmid" and "vector" are used interchangeably as the plasmid is the most commonly 
used form of vector. However, the invention is intended to include such other forms of 
expression vectors which serve equivalent functions and which become known in the art 
subsequently hereto. 

20 "Transcriptional regulatory sequence** is a generic term used throughout the 

specification to refer to DNA sequences, such as initiation signals, enhancers, and 
promoters, which induce or control transcription of protein coding sequences with which 
they are operably linked. In preferred embodiments, transcription of one of the 
recombinant HDx genes is under the control of a promoter sequence (or other 

25 transcriptional regulatory sequence) which controls the expression of the recombinant 
gene in a cell-type in which expression is intended. It will also be understood that the 
recombinant gene can be under the control of transcriptional regulatory sequences which 
are the same or which arc different fi-ora those sequences which control transcription of 
the naturally-occurring forms of HDx genes. 

30 As used herein, the term "tissue-specific promoter" means a DNA sequence that 

serves as a promoter, i.e., regulates e^q)ression of a selected DNA sequence operably 
linked to the promoter, and which eflfects expression of the selected DNA sequence in 
specific cells of a tissue, such as cells of hepatic, pancreatic, neuronal or hematopoietic 
origin. The term also covers so-called "leaky" promoters, which regulate expression of a 
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seleaed DNA primarily in one tissue, but can cause at least low level expression in other 
tissues as well. 

As used herein, a "transgenic animal" is any animal, preferably a non-human 
mammal, bird or an amphibian, in which one or more of the cells of the animal contain 
5 heterologous nucleic acid introduced by way of human intervention, such as by transgenic 
techniques well known in the art. The nucleic acid is introduced into the ceU, dirertly or 
indirectly by introduction into a precursor of the cell, by way of deUberate genetic 
manipulation, such as by microinjection or by infection wiUi a recombinant vims. The 
term genetic manipulation does not include classical cross-breeding, or in vitro 

10 fertilization, but ratiier is directed to the introduction of a recombinant DNA molecule. 
This molecule may be integrated witiiin a chromosome, or it may be extrachromosomaUy 
repUcating DNA. In the typical transgenic animals described herein, the transgene causes 
cells to express a recombinant form of one of the HDx proteins, e.g. either agonistic or 
antagonistic fonns. However, transgenic animals in which the recombinant HDx gene is 

15 silent are also contemplated, as for example, the FLP or CRE recombinase dependent 
constructs described below. - Moreover, "transgenic animal" also includes tiiose 
recombinant animals in which gene disruption of one or more HDx genes is caused by 
human intervention, including both recombination and antisense techniques. 

The "non-human animals" of the invention include vertdiratcs such as rodents, 
20 non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc. Preferred non- 
human animals are selected from the rodent family including rat and mouse, most 
preferably mouse, tiiough transgenic amphibians, such as members of the Xenopus genus, 
and transgenic chickens can also provide important tools for understanding and 
identifying agents which can afifea, for example, erabryogenesis and tissue foiroation. 
25 The invention also contemplates transgenic insects, including those of the genus 
Drosophila, such as D. melanogaster. The term "chimeric animal" is used herein to refer 
to animals in which the recombinant gene is found, or in which the recombinant is 
expressed in some but not all cells of the animal. The term "tissue-spedfic chimeric 
animal" indicates that one of the recombinant HDx genes is present and/or expressed or 
30 disrupted in some tissues but not others. 

As used herein, the term "ti^gene" means a nucleic acid sequence (encoding, 
e.g., one of tiie HDx polypeptides, or pending an antisense transcript tiiereto), which k 
partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which it 
is introduced, or, is homologous to an endogenous gene of tiie transgenic animal or ceU 
35 into which it is introduced, but which is designed to be inserted, or is inserted, into the 
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animal's genome in such a way as to alter the genome of the cell into which it is inserted 
(e.g., it is inserted at a location which differs from that of the natural gene or its insertion 
results in a knockout). A transgene can include one or more transcriptional regulatory 
sequences and any other nucleic acid, such as introns, that may be necessary for optimal 
5 expression of a selected nucleic acid. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such"duplicate genes may be identical 
or may have certain modifications, including nucleotide substitutions, additions or 
deletions, which all still code for polypeptides having substantially the same activity. The 
10 term "DNA sequence encoding an HDx polypeptide" may thus refer to one or more 
genes within a particular individual. Moreover, certain differences in nucleotide 
sequences may exist between individuals of the same species, whidi are called alleles. 
Such allelic differences may or may not result in differences in amino acid sequence of the 
encoded polypeptide yet still encode a protein with the same biolo^cai activity. 

15 "Homology" refers to sequence similarity between two peptides or between two 

nucleic add molecules. Homology can be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
homologous at that position. A degree of homology between sequences is a function of 

20 the number of matching or homologous positions shared by the sequences. An 
"unrelated" or "non-homologous" sequence shares less than 40 percent identity, though 
preferably less than 25 percent identity, with one of the HDx sequences of the present 
invention. 

As used herein, an "/fDx-related" protein refers to the HDx proteins described 
25 herem, and other human homologs of those HDx sequences, as well as orthologs and 
paralogs (homologs) of the HDx proteins in other species, ranging from yeast to other 
mammals, e.g., homologous histone deacetylase, The term "ortholog" refers to genes or 
proteins which are homologs via speciation, e.g., closely related and assumed to have 
conunon descent based on structural and functional considerations. Orthologous proteins 
30 function as recognizably the same activity in different species. The term "paralog" refers 
to genes or proteins which are homologs via gene duplication, e.g., duplicated variants of 
a gene within a genome. See also, Fritch, WM (1970) Syst Zool 19:99-113. 

"Cells," "host cells" or "recombinant host cells" are terms used interchangeably 
herein. It is understood that such terms refer not only to the particular subject cell but to 
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35 



the progeny or potential progeny of such a cell. Because certain modifications may occur 
m succeeding generations due to either mutation or environmental influences, such 
progeny may not, in fact, be identical to the parent cell, but are still included within the 
scope of the term as used herdn. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 
encoding one of the subject HDx polypeptides with a second amino acid sequence 
defining a domain (e.g. polypeptide portion) foreign to affd^not substantially homologous 
with any domain of one of the HDx proteins. A chimeric protem may present a foreign 
domain which is found (albeit in a diflferent protein) in an organism which also expresses 
the first protein, or it may be an "interspecies", "imergenic", etc. fiision of protein 
stnictures expressed by diflferent kinds of organisms. In general, a fiision protein can be 
represented by the general formula X-HDx-Y, wherein HDx represents a portion of the 
protein which is derived from one of die HDx proteins, and X and Y are, independentiy, 
absent or represent amino acid sequences which are not related to one of the HDx 
15 sequences in an organism. 

The term "isolated" as also used herein with respect to nucleic acids, such as 
DNA or RNA, refers to molecules separated from otiier DNAs, or KNAs, respectively, 
that are present in tiie natural source of the macromolecule. For example, an isolated 
nucleic acid encoding one of tiie subject HDx polypeptides preferably includes no more 
than 10 kilobases (kb) of nucleic acid sequence which naturally immediately flanks tiie 
HDx gene in genomic DNA, more preferably no more tiian 5kb of such naturally 
occurring flanking sequences, and most preferably less tiian 1.5kb of such naturally 
occuning flanking sequence. The term isolated as used herein also refers to a nucleic 
add or peptide tiiat is substantially fiee of ceUdar material, viral material, or culture 
medium when produced by recombinant DNA techniques, or chemical pi^ors or 
otiier chemicals when chemicaUy syntiiesized. Moreover, an "isolated nucleic acid" is 
meant to include nucleic acid fragments which are not naturally occuning as fragments 
and would not be found in the natural state. 

As described below, one aspect of tiie invention pertains to isolated nucleic acids 
comprising nucleotide sequences encoding HDx polypeptides, and/or equivalents of such 
nucleic acids. The term nucleic acid as used herein is intended to include fragments as 
equivalents. The term equivalent is understood to include nucleotide sequences encoding 
fimctionally equivalent HDx polypeptides or fiinctionally equivalent peptides having an 
activity oim HDx protein such as described herein. Equivalent nucleotide sequences wiU 
include sequences that differ by one or more nucleotide substitutions, additions or 



20 



25 



30 



wo 97/35990 



PCT/US97/05275 



-24- 

deletions, such as allelic variants; and will, therefore, include sequences that differ from 
the nucleotide sequence of the HDx cDNA sequences shown in any of SEQ ID Nos:l-4 
due to the degeneracy of the genetic code. Equivalents will also include nucleotide 
sequences that hybridize under stringent conditions (i.e., equivalent to about 20-27''C 
5 below the melting temperature (!„,) of the DNA duplex formed in about IM salt) to the 
nucleotide sequences represented in one or more of SEQ ID Nos:l-4. In one 
embodiment, equivalents will further include nudeic acid sequences derived from and 
evolutionarily related to, a nucleotide sequences shown in any of SEQ ID Nos:l-4. 

Moreover, it will be generally appreciated that, under certain drcumstances, it 
10 may be advantageous to provide homologs of one of the subject HDx polypqytides which 
ilmction in a limited capacity as one of other an HDx agonist (mimetic) or an HDx 
antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protdn. Thus, specific biological eflFects can be elicited 
by treatment with a homolog of limited fiinction, and with fewer side effects relative to 
15 treatment with agonists or antagonists which are directed to all of the biological activities 
of naturally occurring forms of HDx proteins. 

Homologs of each of the subject /ffisc proteins can be generated by mutagenesis, 
such as by discrete point mutation(s), or by truncation. For instance, mutation can give 
rise to homologs which retain substantially the same, or merely a subset, of the biological 
20 activity of the HDx polypeptide from which it was derived. Alternatively, antogonistic 
forms of the protein can be generated which are able to inhibit the function of the 
naturally occurring form of the protein, such as by competitively bmding to an HDx 
substrate or HDx associated protein, as for example competing wth wild-type HDx in the 
binding of RbAp48 or a histone. In addition, agonistic forms of the protein may be 
25 generated v^ch are constitutively active, or have an altered K^at or Km for deacetylation 
reactions. Thus, the HDx protein and homologs thereof provided by the subject 
invention may be either positive or negative regulators of transcription and/or replication. 

In general, polypeptides referred to herein as having an activity of an HDx protein 
(e.g., are "bioactive") are defined as polypeptides which include an amino acid sequence 
30 coire^onding (e.g., identical or homologous) to all or a portion of the amino add 
sequences of an /H3r proteins shown in any one or more of SEQ ID Nos:5-8 and which 
mimic or antagonize all or a portion of the biological/biochemical activities of a naturally 
ccuning HDx protdn. Examples of such biological activity include the ability to 
modulate proliferation of cdls. For example, inhibiting histone deacetylation causes cells 
35 to arrest in Gl and G2 phases of the cell cyde. The biochemical activity assodated with 
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HDx proteins of the pr sent invention can also diaracterized in terms of binding to and 
(optionally) catalyzing the deacetylation of an acetylated histone. Another biochemical 
property of certain of the subject HDx proteins involves binding to other cellular 
proteins, such as RbAp48 or Sin3A. 

5 Other biological activities of the subject HDx proteins are described herein or will 

be reasonably apparent to those skilled in the art. According to the present invention, a 
polypeptide has biological activity ifit is a spedfic agonist or antagonist of a naturally- 
occurring form of an HDx protein. 

Preferred nucleic acids encode an HDx polypeptide comprising an amino acid 
10 sequence at least 80% homologous, more preferably at least 85% homologous and most 
preferably at least 88% homologous with an amino acid sequence of a human HDx, e.g., 
such as selected from the group consisting of SEQ ID Nos: 5-8. Nucleic acids which 
encode polypeptides at least about 90%, more preferably at least about 95%, and most 
preferably at least about 98-99% homology with an amino acid sequence represented in 
15 one of SEQ ID Nos:5-8 are of course also within the scope of the invention, as are 
nucleic acids identical in sequence with any of the enumerated HDx sequences of the 
sequence listing. In one embodiment, the nucleic acid is a cDNA encoding a polypeptide 
having at least one activity of the subject HDx polypeptide. 

In certain preferred embodiments, the invention features a purified or recombinant 
20 HDx polypeptide having peptide chain with a molecular weight in the range of 40kd to 
60kd, even more preferably in the range of 45-50 kd or 53-58kd. It will be understood 
that certain post-translational modifications, e.g., phosphorylation and the like, can 
increase the apparent molecular weight of the HDx protein relative to the unmodified 
polypeptide chain, and cleavage of certain sequences, such as pro-sequences, can likewise 
25 decrease the apparent molecular weight. 

In other preferred embodiments, the nucleic acid encodes an HDx polypeptide 
which includes both the v and x motifs, and preferably possess a histone deacetylase 
activity. For example, preferred HDx proteins are represented by the general formula A- 
(v motif)-B-(x motif)-C, wherein the v motif is an amino add sequence represented in 
30 SEQ ID No. 12, more preferably SEQ ID No. 13, the % motif is an amino acid sequence 
represented in SEQ ID No, 14, more preferably SEQ ID No, 15, and A, B and C 
represent amino acid sequences which are correspond to HDx or /^Dx-related proteins. 

Still other preferred nucleic acids of the present invention encode an HDx 
polypeptide which includes a polypeptide sequence corresponding to all or a portion of 
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amino acid residues of any one of SEQ ID Nos: 5-8, e.g., at least 5, 10, 25, 50 or 100 
amino acid residues of that region. 

Another aspect of the invention provides a nucleic acid which hybridizes under 
high or low stringency conditions to the nucleic acid represented by SEQ ID No: 1. 
5 Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 
sodium chloride/sodium citrate (SSC) at about 45^*0, followed by a wash of 2.0 x SSC at 
SO^'C, are known to tihiosie skilled in the art or can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt 
concentration in the wash step can be selected from a low stringency of about 2.0 x SSC 
10 at 50°C to a high stringency of about 0.2 x SSC at 50^C. In addition, the temperature in 
the wash step can be increased from low stringency conditions at room temperature, 
about 22'*C, to high stringency conditions at about 65**C. 

Nucleic acids, having a sequence that differs from the nucleotide sequences 
shown in one of SEQ ID Nos: 1-4 due to degeneracy in the genetic code are also within 

15 the scope of the invention. Such nucleic acids encode functionally equivalent peptides 
(i.e., a peptide having a biological activity of an HDx polypeptide) but differ in sequence 
from the sequence shown in the sequence listing due to degeneracy in the genetic code. 
For example, a number of amino acids are designated by more than one triplet, Codons 
that specify the same amino acid, or synonyms (for example, CAU and CAC each encode 

20 histtdine) may result in "silent" mutations which do not affect the amino acid sequence of 
an HDx polypeptide. However, it is expected that DNA sequence polymorphisms that do 
lead to changes in the amino acid sequences of the subject HDx polypeptides will exist 
among, for example, humans. One skilled in the art will appreciate that these variations 
in one or more nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids 

25 encoding polypeptides having an activity of an HDx polypeptide may exist among 
individuals of a given species due to natural allelic variation. 

As used herein, an HDx gene fragment refers to a nucleic acid having fewer 
nucleotides than the nudeotide sequence encoding the entire mature form of an HDx 
protein yet which (preferably) encodes a poi)9eptide which retains some biological 
30 activity of the fiiU length protdn. Fragment sizes contemplated by the present invention 
include, for example, 5, 10, 25, 50, 75, 100, or 200 amino acids in length. 

As mdicated by the examples set out below, HDx protein-encoding nucleic acids 
can be obtained from mRNA present in any of a number of eukaryotic cells. It should 
also be possible to obtain nucleic acids encoding HDx polypeptides of the present 



wo 97/35990 



PCT/US97/05275 



-27- 



invention from genomic DNA from both adults and embtyos. For example, a gene 
encodmg an HDx protein can be cloned from either a cDNA or a genomic library in 
accordance with protocols described herein, as well as those generally known to persons 
skilled m the art. A cDNA encoding an HDx protem can be obtained by isolating total 
mRNA from a ceU, e.g. a mammalian ceU. e.g. a human cell, including embryonic cells 
Double stranded cDNAs can then be prepared from the total mRNA, and subsequently 
mserted mto a suitable plasmid or bacteriophage vector using any one of a number of 
known techniques. The gene encoding an HDx protein can also be cloned using 
estabhshed polymerase chain reaction techniques in accordance with the nucleotide 
10 sequence infonnation provided by the invention. The nucleic acid of the invention can be 
DNA or RNA A preferred nucleic acid is a cDNA including a nucleotide sequence 
represented by one of SEQ ID Nos: 1-4. 

Another aspect of the invention relates to the use of the isolated nucleic acid in 
"antisense" therapy. As used herein, "antisense" therapy refers to administration or m 

15 situ generation of oligonucleotide probes or their derivatives which specifically hybridize 
(e.g. bmds) under cellular conditions, witii the ceUular mRNA and/or genomic DNA 
encodmg one or more of tiie subject HDx proteins so as to inhibit expression of that 
protein, e.g. by inhibiting transcription and/or translation. The binding may be by 
conventional base pair complementarity, or. for example, in tiie case of binding to DNA 

20 duplexes, Uirough specific interactions in tiie major groove of the double heUx In 
general, "antisense" therapy refer, to the range of techniques generally employed in the 
art, and includes any tiierapy which relies on specific binding to oligonucleotide 
sequences. 

An antisense constiuct of the present invention can be delivered, for example as 
25 an expression plasmid which, when transcribed in tiie cell, produces RNA which is 
complementaor to at least a unique portion of tiie ceUuIar mRNA which encodes an HDx 
protem. Alternatively, the antisense constiuct is an oUgonucIeotide probe which is 
generated ex vivo and which, when introduced into tiie ceU causes inhibition of 
expression by hybridizing witii tiie mRNA and/or genomic sequences of an HDx gene 
30 Such OUgonucIeotide probes are preferably modified oligonucleotides which are resistant 
to endogenous nucleases, e.g. exonudeases and/or endonudeases, and are therefore 
sUble m vivo. Exemplaiy nucleic acid molecules for use as antisense oUgonucleotides are 
phosphoramidate. phosphothioate and methylphosphonate analogs of DNA (see also US 

Addiuonally. general approaches to constnicting oUgomers useful in antisense therapy 
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have been reviewed, for example, by Van der Krol et al. (1988) Biotechniques 6:958- 
976; and Stein et al. (1988) Cancer Res 48:2659-2668, 

Accordingly, the niodified oligomers of the invention are useful in therapeutic, 
diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized 

5 in a manner appropriate for antisense therapy in general. For such therapy, the oligomers 
of the invention can be formulated for a variety of routes of administration, induding 
systemic and topical or localized administration. Techniques and formulations generally 
may be found in Remmingtprfs Pharmaceutical Sciences, Meade Publishing Co., Easton, 
PA. For systemic administration, injection is preferred, including intramuscular, 

10 intravenous, intraperitoneal, and subcutaneous. For injection, the oligomers of the 
invention can be formulated in liquid solutions, preferably in physiologically compatible 
buffers such as Hank's solution or Ringer's solution. In addition, the oligomers may be 
formulated in solid form and redissolved or suspended inunediately prior to use. 
Lyophilized forms are also included. 

15 Systemic administration can also be by transmucosal or transdermal means, or the 

compounds can be administered orally. For transmucosal or transdermal admmistration, 
penetrants appropriate to the barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art, and include, for example, for transmucosal 
administration bile salts and fiisidic acid derivatives. In addition, detergents may be used 

20 to facilitate permeation. Transmucosal administration may be through nasal sprays or 
using suppositories. For oral administration, the oligomers are formulated into 
conventional oral administration forms such as capsules, tablets, and tonics. For topical 
administration, the oligomers of the invention are formulated into ointments, salves, gels, 
or creams as generally known in the art. 

25 In addition to use in therapy, the oligomers of the invention may be used as 

diagnostic reagents to detect the presence or absence of the target DNA or RNA 
sequences to which they specifically bind. Such diagnostic tests are desaibed in further 
detail below. 

Likewise, the antisense constructs of the present invention, by antagonimig the 
30 normal biolo^cal activity of one of the HDx proteins, can be used in the manipulation of 
tissue, e.g. tissue differentiation or growth, both in vivo and ex vivo. 

Furthermore, tiie anti-sense techniques (e.g. microinjection of antisense 
molecules, or transfection with plasmids whose transcripts are anti-sense with regard to 
an HDx mRNA or gene sequence) can be used to investigate role of HDx in 
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developmental events, as weU as the normal cellular function of HDx in adult tissue. 
Such techniques can be utilized in cell culture, but can also be used in the creation of 
transgenic animals (described infra). 

This invention also provides expression vectors containing a nucleic acid 
5 encoding an HDx polypeptide, operably linked to at least one transcriptional regulatory 
sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a 
regulatory sequence in a manner which allows expression of the nucleotide sequence. 
Regulatory sequences are art-recognized and are selected to direct expression of the 
subject HDx proteins. Accordingly, the term transcriptional r«?gulatory sequence includes 
10 promoters, enhancers and other expression control elements. Such regulatory sequences 
are described in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press. San Diego. CA (1990). For instance, any of a wide variety of 
expression control sequences, sequences that control the expression of a DNA sequence 
when operatively linked to it, may be used in these vectors to express DNA sequences 
15 encoding HDx polypeptides of this invention. Such useful expression control sequences, 
include, for example, a viral LTR, such as the LTR of the Moloney murine leukemia 
virus, the early and late promoters of SV40. adenoviriis or cytomegalovirus immediate 
early promoter, the lac system, the trp system, the TAC or TRC system, T7 promoter 
whose expression is directed by T7 RNA polymerase, the major operator and promoter 
20 regions of phage X, the control regions for fd coat protein, the promoter for 3- 
phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase. 
e.g., Pho5. the promoters of the yeast a-mating factors, the polyhedron promoter of the 
baculovirus system and other sequences known to control the expression of genes of 
prokaiyotic or eukaiyotic cells or their viruses, and various combinations thereof It 
25 should be understood that the design of the expression vector may depend on such 
factors as the choice of the host cell to be transformed and/or the type of protein desired 
to be expressed. Moreover, the vector's copy number, the ability to control that copy 
number and the expression of any other proteins encoded by the vector, such as antibiotic 
maricers. should also be considered. In one embodiment, the expression vector inchides a 
30 recombinant gene encoding a peptide having an agonistic activity of a subject HDx 
polypeptide, or alternatively, encoding a peptide which is an antagonistic form of the 
HDx protein, such as a catalytically-inactive deacetylase. Such expression vectors can be 
used to transfect ceUs and thereby produce polypeptides, including fusion proteins, 
encoded by nucleic acids as described herein. 
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Moreover, the gene constructs of the present invention can also be used as a part 
of a gene therapy protocol to deliver nucleic acids, e.g., encoding either an agonistic or 
antagonistic form of one of the subject HDx proteins or an antisense molecule described 
above. Thus, another aspect of the invention features expression vectors for in vivo or in 
5 vitro transfection and expression of an HDx polypeptide or antisense molecule in 
particular cell types so as to reconstitute the function of, or alternatively, abrogate the 
function of /fZ>x-^induced transcription in a tissue in which the naturally-occurring form of 
the protein is misexpressed; or to deliver a form of the protein which alters differentiation 
of tissue, or which inhibits neoplastic transformation. 

10 Expression constructs of the subject HDx polypeptides, as well as antisense 

constructs, may be administered in any biologically effective carrier, e.g. any formulation 
or composition capable of effectively delivering the recombinant gene to cells in vivo. 
Approaches include insertion of the subject gene in viral vectors including recombinant 
retro\iruses, adenovirus, adeno-assodated virus, and herpes simplex virus- 1, or 

IS recombinant bacterial or eukaryotic plasmids. Viral vectors transfect cells directly; 
plasmid DNA can be delivered with the help of, for example, cationic liposomes 
(lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin 
S, artificial viral envelopes or other such intracellular carriers, as well as direct injection 
of the gene construct or CaP04 precipitation carried out in vivo. It will be appreciated 

20 that because transduction of appropriate target cells represents the critical first step in 
gene therapy, choice of the particular gene delivery system will depend on such factors as 
the phenotype of the intended target and the route of administration, e.g. locally or 
systemically. Furthermore, it will be recognized that the particular gene construct 
provided for in vivo transduction of HDx expression are also usefiil for in vitro 

2S transduction of cells, such as for use in the ex vivo tissue culture systems described 
below. 

A preferred approach for in vivo introduction of nucleic acid into a cell is by use 
of a viral vector containing nucleic acid, e.g. a cDNA encoding the particular HDx 
polypeptide desired. Infection of cells with a viral vector has the advantage that a large 
30 proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are 
expressed eflficiently in cells which have taken up viral vector nucleic acid. Retrovirus 
vectors, adenovirus vectors and adeno-associated virus vectors are exemplary 
recombinant gene delivery system for the transfer of exogenous genes in vivo. 
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particularly into humans. These vectors provide eflScient delivery of genes into cells, and 
the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. 

In addition to viral transfer methods, such as those illustrated above, non-viral 
methods can also be employed to cause expression of a subject HDx polypeptide in the 
5 tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms 
used by mammalian cells for the uptake and intracellular transport of macromolecules. In 
preferred embodiments, non-viral gene delivery systems of the present invention rely on 
endocytic pathways for the uptake of the subject HDx polypeptide gene by the targeted 
cell. Exemplary gene delivery systems of this type include liposomal derived systems, 
10 poly-lysine conjugates, and artificial viral envelopes. 

In clinical settings, the gene delivery systems for the therapeutic HDx gene can be 
introduced into a patient by any of a number of methods, each of v^diich is femiliar in the 
art. For instance, a pharmaceutical preparation of the gene delivray system can be 
introduced systemically, e.g. by intravenous injection, and spedfic transduction of the 

15 protein in the target cells occurs predominantly fi^om specificity of transfection provided 
by the gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional 
regulatory sequences controlling expression of the receptor gene, or a combination 
thereof In other embodiments, initial delivery of the recombinant gene is more limited 
with introduction into the animal being quite localized. For example, the gene delivery 

20 vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by stereotactic 
injection (e.g. Chen et al. (1994) PNAS 91; 3054-3057). AN HDx gene, such as any one 
of the clones represented in the group consisting of SEQ ID NO: 1-4, can be delivered in 
a gene therapy construct by electroporation using techniques described, for example, by 
Dev et al. ((1994) Cancer Treat Rev 20:105-1 15). 

25 The pharmaceutical preparation of the gene therapy construct can consist 

essentially of the gene delivery system in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery system can be produced intact fi-om recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can comprise one or more cells which 

30 produce the gene delivery system. 

Another aspect of the present invention concerns recombinant forms of the HDx 
proteins. Recombinant polypeptides preferred by the present invention, in addition to 
native HDx proteins, are at least 80% homologous, more preferably al least 85% 
homologous and most preferably at least 88% homologous with an amino acid sequence 



wo 97/35990 



PCT/US97/05275 



-32- 

represented by any of SEQ ID Nos: 5-8. Polypeptides which possess an activity of an 
HDx protem (i.e. either agonistic or antagonistic), and which are at least 90%, more 
preferably at least 95%, and most preferably at least about 98-99% homologous with a 
sequence selected from the group consisting of SEQ ID Nos: 5-8 are also withm the 
5 scope of the invention. In other preferred embodiments, the HDx polypeptide includes 
both the V and x motifs, and preferably possess a histone deacetyiase activity. 

The term "recombinant HDx protein" refers to a polypeptide which is produced 
by recombinant DNA techniques, wherein generally, DNA encoding an HDx polypeptide 
is inserted into a suitable expression vector which is in turn used to transform a host cell 

10 to produce the heterologous protein. Moreover, the phrase "derived from", with respect 
to a recombinant HDx gene, is meant to include within the meaning of "recombinant 
protein" those proteins having an amino add sequence of a native HDx protem, or an 
amino acid sequence similar thereto which is generated by mutations induding 
substitutions and ddetions (induding truncation) of a naturally occurring form of the 

15 protein. 

The present invention further pertains to recombinant forms of the subject HDx 
polypeptides whidi are encoded by genes derived from a mammal (e.g. a human), and 
which have amino acid sequences evolutionarily related to the HDx proteins represented 
in SEQ ID Nos: 5-8. Such recombinant HDx polypeptides prefi^bly are capable of 

20 functioning in one of either role of an agonist or antagonist of at least one biological 
activity of a wid-type ("authentic") HDx protein of the appended sequence listing. The 
term "evolutionarily related to", with respect to amino acid sequences of HDx proteins, 
refers to both polypeptides having amino acid sequences which have arisen naturally, and 
also to mutational variants of HDx polypeptides which are derived, for example, by 

25 combinatorial mutagenesis. 

The present invention also provides methods of producing the subject HDx 
polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured 
under appropriate conditions to allow expression of the peptide to occur. The cells may 

30 be harvested, lysed and the protdn isolated. A cell culture includes host cells, media and 
other byproducts. Suitable media for cell culture are well known in the art. The 
recombinant HEhc polypeptide can be isolated from cell culture medium, host cells, or 
both using techmques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 

35 inununoaffinity purification with antibodies specific for such peptide. In a preferred 
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embodiment, the recombinant HDx polypeptide is a fusion protein containing a domain 
which facilitates its purification, such as GST fusion protein or poIy(His) fusion protein. 

This invention also pertains to a host cell transfected to express recombinant 
forms of the subject HDx polypeptides. The host cell may be any prokaiyotic or 
5 eukaryotic cell. Thus, a nucleotide sequence derived from the cloning of HDx proteins, 
encoding all or a selected portion of a full-length protein, can be used to produce a 
recombinant form of an HDx polypeptide via microbial or eukaryotic cellular processesT 
Ligating the polynucleotide sequence into a gene constmct, such as an expression vector, 
and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect or 
10 mammalian) or prokaryotic (bacterial cells), are standard procedures used in producing 
other well-known proteins, e.g. MAP kinases, p53, WTl, PTP phosphatases, SRC, and 
the like. Similar procedures, or modifications thereof, can be employed to prepare 
recombinant HDx polypeptides by microbial means or tissue-culture technology in accord 
with the subject invention. 

1 5 The recombinant HDx genes can be produced by ligating nucleic acid encoding an 

HDx protein, or a portion thereof, into a vector suitable for expression in either 
prokaryotic cells, eukaryotic cells, or both. Expression vectors for production of 
recombinant forms of the subject HDx polypeptides include plasmids and other vectors. 
For instance, suitable vectors for the expression of an HDx polypeptide include plasmids 

20 of the types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived 
plasmids, pBTac-derived plasmids and pUC-derived plasmids for expression in 
prokaryotic ceUs, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in yeast. 
For instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are cloning and 

25 expression vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, 
for example, Broach et al. (1983) in Experimental Manipulation of Gene Expression, 
ed. M. Inouye Academic Press, p. 83, incorporated by reference herein). These vectors 
can replicate in E. coli due the presence of the pBR322 ori, and in S. cerevisiae due to 
the replication determinant of the yeast 2 micron plasmid. In addition, drug resistance 

30 markers such as ampicillin can be used. In an illustrative embodiment, an HDx 
polypeptide is produced recombinantly utilizing an expression vector generated by sub- 
cloning the coding sequence of one of the HDx genes represented in SEQ ID Nos: 1-4. 

The preferred mammalian expression vectors contain both prokaryotic sequences, 
to facilitate the propagation of the vector in bacteria, and one or more eukaryotic 
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transcription units that are expressed in eukaryotic cells. The pcDNAI/amp, 
pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-(lhfr, pTk2, pRSVneo, pMSG, 
pSVT7, pko-neo and pHyg derived vectors are examples of mammalian expression 
vectors suitable for transfection of eukaryotic cells. Some of these vectors are modified 
S with sequences fi-om bacterial plasmids, such as pBR322, to facilitate replication and 
drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively, 
derivatives of viruses such as the bovine papillomavirus (BPV-I), or Epstieiii-Barr virus 
(pHEBo, pREP-derived and p205) can be used for transient expression of proteins in 
eukaryotic cells. The various methods ^ployed in the preparation of the plasmids and 
10 transformation of host organisms are well known in the art. For other suitable expression 
systems for both prokaryotic and eukaryotic cells, as well as general recombinant 
procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 and 17. 

In some instances, it nuy be desirable to express the recombinant HDx 
15 polypeptide by the use of a baculo virus expression system. Examples of such baculo virus 
expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and 
pVL941), pAcUW-derived vectors (such as pAcUWl), and pBlueBac-derived vectors 
(such as the Q-gal containing pBlueBac III). 

When it is desirable to express only a portion of an HDx protein, such as a form 
20 lacking a portion of the N-terminus, i.e. a truncation mutant which lacks the signal 
peptide, it may be necessary to add a start codon (ATG) to the oligonucleotide fragment 
containing the desired sequence to be expressed. It is well known in the art that a 
methionine at the N-terminal position can be enzymatically cleaved by the use of the 
enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coli (Ben- 
25 Bassat et al. (1987) J. Bacteriol. 169:751-757) and Salmonella typhimurium and its in 
vitro activity has been demonstrated on recombinant proteins (Miller et al. (1987) PNAS 
84:2718-1722). Therefore, removal of an N-terminal methionine, if desired, can be 
achieved either in vivo by expressing /^Dx-derived polypeptides in a host which produces 
MAP (e.g., E. coli or CM89 or S. cerevisiae), or in vitro by use of purified MAP (e.g., 
30 procedure of Miller et al., supra). 

Alternatively, the coding sequences for the polypeptide can be incorporated as a 
part of a fusion gene including a nucleotide sequence encoding a different polypeptide. 
This type of expression system can be useful under conditions where it is desirable to 
produce an immunogenic fragment of an HDx protein. For example, the VP6 capsid 
35 protein of rotavirus can be used as an immunologic carrier protein for portions of the 
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HDx polypeptide, either in the monomeric form or in the form of a viral particle. The 
nucleic acid sequences corresponding to the portion of a subject HDx protein to which 
antibodies are to be raised can be incorporated into a fusion gene construa which 
bcludes coding sequences for a late vaccinia virus stmctural protein to produce a set of 
recombinant viruses expressing fusion proteins comprising HDx epitopes as part of the 
virion. It has been demonstrated with the use of immunogenic fusion proteins utilizing 
the Hepatitis B surface antigen fusion proteins that recombinant Hepatitis B,^pns can 
be utilized in this role as well. Similariy, chimeric constiaicts coding fo"r fbSon proteins 
containing a portion of an HDx protein and the poliovirus capsid protein can be created 
to enhance immunogenicity of the set of polypeptide antigens (see. for example. EP 
PubUcation No: 0259149; and Evans et al. (1989) Nature 339:385; Huang et al. (1988) 
J. Virol. 62:3855; and Schlienger et al. (1992) J. Virol. 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can also be 
utilized to generate an immunogen, wherein a desired portion of an HDx polypeptide is 
15 obtained directly from organo-chemical synUiesis of the peptide onto an oligomeric 
branching lysine core (see. for example. Posnett et al. (1988) JBC 263:1719 and Nardelli 
et al. (1992) J. Immunol. 148:914). Antigenic determinants of HDx proteins can also be 
expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
appredated that fusion proteins can also facilitate tiie expression of proteins, and 
accordingly, can be used in the expression of the HDx polypeptides of the present 
invention. For example, HDx polypeptides can be generated as glutatiiione-S-transferase 
(GST-fiision) proteins. Such GST-fusion proteins can enable easy purification of the 
HDx polypeptide, as for example by tfie use of glutaUiione-derivatized matrices (see, for 
example. Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y • John Wiley 
& Sons. 1991)). 

In another embodiment, a fiision gene coding for a purification leader sequence, 
such as a poly-(Hisyenterokinase cleavage site sequence at tiie N-terminus of the desired 
portion of the recombinant protein, can allow purification of the expressed fusion protein 
by aflBnity chromatography using a Ni2+ metal resm. The purification leader sequence 
can then be subsequentiy removed by treatinent wiU. enterokinase to provide the purified 
protein (e.g.. see Hochuli et al. (1987) J. Chromatography 41 1:177; and Janknecht et al 
PNAS 88:8972). 
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Techniques for making fusion genes are known to those skilled in the art. 
Essentially, the joining of various DNA fragments coding for diflferent polypeptide 
sequences is performed in accordance with conventional techniques, employing blunt- 
ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
5 £^)propriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
. the fusion gene can be synthesized by conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs between two consecutive 
10 gene fragments which can subsequently be annealed to generate a chuneric gene sequence 
(see, for example. Current Protocols in Molecular Biology, eds. Ausubel et al. John 
Wiley & Sons: 1992). 

HDx polypeptides may also be chemically modified to create HDx derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
15 groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of HDx 
proteins can be prepared by linking the chemical moieties to functional groups on amino 
acid sidechains of the protein or at the N-terminus or at the C-terminus of the 
polypeptide. 

The present invention also makes available isolated HDx polypeptides which are 

20 isolated from, or otherwise substantially free of other pellular proteins, especially other 
signal transduction factors and/or transcription factors which may normally be associated 
with the HDx polypeptide. The term "substantially free of other cellular proteins" (also 
referred to herein as "contaminating proteins") or "substantially pure or purified 
preparations" are defined as encompassing preparations of HDx polypeptides having less 

25 than 20% (by dry wdght) contaminating protein, and preferably having less than 5% 
contaminating protein. Functional forms of the subject polypeptides can be prepared, for 
the first time, as purified preparations by using a cloned gene as described herein. By 
"purified", it is meant, when referring to a peptide or DNA or KNA sequence, that the 
indicated molecule is present in the substantial absence of other biological 

30 macromolecules, such as other proteins. The term "purified" as used herein preferably 
means at least 80% by diy weight, more preferably in the range of 95-99% by weight, 
and most preferably at least 99.8% by weight, of biological macromolecules of the same 
type present (but water, buffers, and other small molecules, especially molecules having a 
molecular weight of less than 5000, can be present). The term "pure" as used herein 

35 preferably has the same numerical limits as "purified" inunediatcly above. "Isolated" and 
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"purified" do not encompass either natural materials in their native state or natural 
materials that have been separated into components (e.g., in an acrylamide gel) but not 
obtained either as pure (e.g. lacking contaminating proteins, or chromatography reagents 
such as denaturing agents and polymers, e.g. acrylamide or agarose) substances or 
solutions. In preferred embodiments, purified HDx preparations will lack any 
contaminating proteins from the same animal from that HDx is normally produced, as can 
be accomplished by recombinant expression o^ for example, . .a . human HDx protein in a 
non-human cell. 

As described above for recombinant polypeptides, isolated HDx polypeptides can 
include all or a portion of an amino acid sequences corresponding to ixiHDx polypeptide 
represented in any one of SEQ ID Nos: 5-8 or homologous sequences thereto. In 
preferred embodiments, the HDx polypeptide includes both the v and x motifs, and 
preferably possess a histone deacetylase activity. 

Isolated peptidyl portions oiHDx proteins can be obtained by screening peptides 
recombinantly produced from the corresponding fragment of the nucleic acid encoding 
such peptides. In addition, fragments can be chemically synthesized using techniques 
known in the art such as conventional Memfidd solid phase f-Moc or t-Boc chemistry. 
For example, an HDx polypeptide of the present invention may be ariiitrarily divided into 
fragments of desired length with no overiap of the fragments, or preferably divided into 
overlapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments 
which can function as either agonists or antagonists of a wUd-type (e.g.. "authentic") 
HDx protein. 

The recombinant HDx polypeptides of the present invention also include 
homologs of the authentic HDx proteins, such as versions of those protein which are 
resistant to proteolytic cleavage, as for example, due to mutations which alter 
ubiquitination or other enzymatic targeting associated with the protein. 

Modification of the stnicture of the subject HDx polypeptides can be for such 
purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo shelf 
life and resistance to proteolytic degradation in vivo), or post-translational modifications 
(e.g.. to alter phosphorylation pattern of protein). Such modified peptides, when 
designed to retain at least one activity of the naturally-occurring form of the prolein, or 
to produce specific antagonists thereof, are considered fiinctional equivalents of the HDx 
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polypeptides described in more detail herein. Such modified peptides can be produced, 
for instance, by amino acid substitution, deletion, or addition. 

For example, it is reasonable to expect that an isolated replacement of a leucine 
with an isoleucine or valine, an aspartate with a glutamate, a threoiune with a serine, or a 
5 similar replacement of an amino acid with a structurally related amino acid (ue, isosteric 
and/or isoelectric mutations) will not have a major effect on the biological activity of the 
resulting molecule. Conservative replacements are" those that take place within a family of 
amino acids that are related in their side chains. Genetically encoded amino acids are can 
be divided into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, 

10 arginine, histidine; (3) nonpolar = alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, asparagine, 
glutamine, cysteine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine 
are sometimes classified jointly as aromatic amino acids. In similar fashion, the amino 
acid repertoire can be grouped as (1) acidic = aspartate, glutamate; (2) basic = lysine, 

15 arginine histidine, (3) aliphatic = glycine, alanine, valine, leucine, isoleudne, serine, 
threonine, with serine and threonine optionally be grouped separately as aliphatic- 
hydroxyl; (4) aromatic = phenylalanine, tyrosine, tryptophan; (5) amide = asparagine, 
glutamine; and (6) sulfur -containing = cysteine and methionine, (see, for example. 
Biochemistry, 2nd ed., Ed. by L. Stryer, WH Freeman and Co.: 1981). Whether a 

20 change in the amino acid sequence of a peptide results in a functional HDx homolog (e.g. 
functional in the sense that the resulting polypeptide mimics or antagonizes the wild-type 
form) can be readily determined by assessing the ability of the variant peptide to produce 
a response in cells in a fashion similar to the wild-type protein, or competitively inhibit 
such a response. Polypeptides in which more than one replacement has taken place can 

25 readily be tested in the same manner. 

This invention further contemplates a method for generating sets of combinatorial 
mutants of the subject HDx proteins as well as tmncation mutants, and is especially 
useful for identifying potential variant sequences (e.g. homologs) that are fimctional in 
modulatmg histone deacetylation. The purpose of screening such combinatorial libraries 

30 is to gmerate, for example, novel HDx homologs which can aa as either agonists or 
antagonist, or alternatively, possess novel activities all together. To illustrate, HDx 
homologs can be engineered by the present method to provide selective, constitutive 
activation of enzymatic activity. Thus, combinatorially-derived homologs can be 
generated to have an increased potency relative to a naturally occurring form of the 

35 protein. 
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Likewise, HDx homologs can be generated by the present combinatorial approach 
to selectively inhibit (antagonize) histone deacetylation. For instance, mutagenesis can 
provide HDx homologs which are able to bind other regulatory proteins or cytoskeletal 
elements (or DNA) yet prevent acetylation of histones, e.g. the homologs can be 
5 dominant negative mutants. In a preferred embodiment, a dominant negative mutant of 
an HDx protein is mutated at one or more residues of its catalytic site and/or specificity 
subsites. - - ... . ^...^i : . 

In one aspect of this method, the amino add sequences for a population of HDx 
homologs or other related proteins are aligned, preferably to promote the highest 

10 homology possible. Such a population of variants can include, for example, HDx 
homologs from one or more species. Amino acids which appear at each position of the 
aligned sequences are selected to create a degenerate set of combinatorial sequences. In 
a preferred embodiment, the variegated library of HDx variants is generated by 
combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene 

15 library. For instance, a mixture of synthetic oligonucleotides can be enzymatically ligated 
into gene sequences such that the degenerate set of potential HDx sequences are 
expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins 
(e.g. for phage display) containing the set of HDx sequences therein. 

As illustrated in Figure 5B, to analyze the sequences of a population of variants, 
20 the amino acid sequences of interest can be aligned relative, to sequence homology. The 
presence or absence of amino acids from an aligned sequence of a particular variant is 
relative to a chosen consensus length of a reference sequence, which can be real or 
artificial. For instance, Figure 5B includes the alignment of the v and x-motifs for several 
of the HDx gene products. Analysis of the alignment of these sequences from the HDx 
25 clones can give rise to the generation of a degenerate library of polypeptides comprising 
potential HDx sequences. In an exemplary embodiment, a library of variants based on 
thcHDJ sequence, but degenerate across each of the v and x-motifs can be provided. 
On such library can be represented by the general formula A-(v motif>B-(x motif)-C, 
wherein the v motif is an amino acid sequence represented in the general formula 

DIAXl^WAGGIJ^HAKKX2EASGFCYVNDIVX3X4I^^ 
7ro-RVKnVSF 

the X motif is an amino acid sequence represented in the general formula 
CVEX8VKX9FNXioPLLXi iLGGGGYTXijRNVARCWTYET 



30 
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A corresponds to Metl-Thrl29 of SEQ ID No. 5, B corresponds to Hisl99-Lys283 of 
SEQ ID No. 5, and C corresponds to Ala317-Ala482 of SEQ ID No. 5, wherein X| 
represents Ue or Val; represents Phe or Ser; X3 represents Phe or Leu; X4 represents 
Gly or Ala; X5 represents Pro or Gin; Xg represents Gin or Glu; X7 represents Leu or 
5 Thr; Xg represents Val or Tyr, X9 represents Thr or Ser; Xjo represents Leu or lie; Xj j 
represents Met or Val; and Xjj represents lie or Val. To further expand the 
combinatorial set, other consetyative mutations relative to those appearing in the human 
sequences can be provided. For example, in a more expansive library, Xi represents Gly, 
Ala, Val, lie or Leu; Xj represents Phe, Tyr, Thr or Ser; X3 represents Phe, Tyr, Gly, 

10 Ala, Val, He or Leu; X4 represents Gly, Ala, Val, lie or Leu; X5 represents Pro, Asn or 
Ghi; Xfi represents Asn, Gin, Asp or Glu; X7 represents Gly, Ala, Val, lie. Leu, Ser or 
Thr; Xg represents Gly, Ala, Val, lie. Leu, Phe or Tyr, X9 represents Thr, Cys, or Ser; 
Xio represents Gly, Ala, Val, lie or Leu; X^ represents Met, Cys, Gly, Ala, Val, lie, 
Leu, Ser or Thr; and X12 represents Gly, Ala, Val, De or Leu. In stiU another Ubrary, 

15 each degenerate position can be any one of the naturally occurring amino adds, 
likewise, the v and x-motifs can correspond to the degenerate sequences designated by 
SEQ ID Nos. 12 and 14, respectively. 

There are many ways by which such libraries of potential HDx homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 

20 degenerate gene sequence can be carried out m an automatic DNA synthesizer, and the 
^thetic genes then ligated into an appropriate expression vector. The purpose of a 
degenerate set of genes is to provide, in one mixture, all of the sequences encoding the 
desired set of potential HDx sequences. The synthesis of degenerate oligonucleotides is 
well known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et 

25 al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Mao-opiolecules, ed. AG 
Walton, Amsterdam: Elsevier pp273.289; Itakura et al. (1984) Annu. Rev. Biochem. 
53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 
1 1:477. Such techniques have been employed in the directed evolution of other proteins 
(see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 

30 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwiria et al. (1990) PNAS 87: 
6378-6382; as weU as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

Likewise, a library of coding sequence fragments can be pro\dded for an HDx 
clone in order to generate a variegated population of HDx fragments for screening and 
subsequent selection of bioactive fragments. A variety of techniques are known in the art 
35 for generating such libraries, including chemical synthesis. In one embodiment, a library 
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of coding sequence fragments can be generated by (i) treating a double stranded PCR 
fragment of an HDx coding sequence with a nuclease under conditions wherein nicking 
occurs only about once per molecule; (ii) denaturing the double stranded DNA; (iii) 
renaturing the DNA to form double stranded DNA which can include sense/antisense 
5 pairs from different nicked products; fyi) removing single stranded portions from 
reformed duplexes by treatment with SI nuclease; and (v) ligating the resulting fragment 
library into an expressbn vector, ^ By this jcxemplaiy method, an expression library can be 
derived which codes for N-terminal, C-terminal and internal fi^igments of various sizes. 

A wide range of techniques are known in the art for screening gene produas of 

10 combinatorial libraries made by point mutations or taincation, and for screening cDNA 
libraries for gene products having a certain property. Such techniques will be generally 
adaptable for r^id screening of the gene libraries generated by the combinatorial 
mutagenesis HDx homologs. The most widely used techniques for screening large 
gene libraries typically comprises cloning the gene library into repUcaWe expression 

15 vectors, transforming appropriate cells with the resulting library of vectors, and 
expressing the combinatorial genes under conditions in which detection of a desired 
activity facilitates relatively easy isolation of the vector encoding the gene whose product 
was detected. 

In an exemplary embodimait, the library of HDx variants is expressed as a fusion 
20 protein on the surface of a viral particle. For instance, in the filamentous phage system, 
foreign peptide sequences can be expressed on the surface of infectious phage, thereby 
conferring two significant benefits. First, since these phage can be applied to aflBnity 
matrices at very high concentrations, a large number of phage can be screened at one 
time. Second, since each infectious phage displays the combinatorial gene product on its 
25 surface, if a particular phage is recovered from an aflMty matrix in low jidd, the phage 
can be amplified by another round of infection. The group of ahnost identical E. coli 
filamentous phages M13, fd., and fl are most often used in phage display libraries, as 
either of the phage gHI or gVIH coat proteins can be used to generate fiision proteins 
without disrupting the ultimate packaging of the viral particle (Ladner et al. PCT 
30 pubUcation WO 90/02909; Garrard et al., PCT pubUcation WO 92/09690; Marks et al. 
(1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; 
Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89-4457- 
4461). 

For example, the recombinant phage antibody system O^AS, Pharmacia Catalog 
35 number 27-9400-01) can be easily modified for use in expressing and screening HDx 
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combinatorial libraries by panning on glutathione immobilized histones/GST fusion 
proteins or RbAp48/GST fusion protein to enrich for HDx homoiogs which retain an 
ability to bind a substrate or regulatory protein. Each of these HDx homoiogs can 
subsequently be screened for further biological activities in order to differentiate agonists 
5 and antagonists. For example, histone-binding homoiogs isolated from the combinatorial 
library can be tested for their enzymatic activity directly, or for their effect on cellular 
proliferation relative to the wild-type form of the protein. 

The invention also provides for reduction of the HDx or RbAp48 or histones 
proteins to generate mimetics, e.g. peptide or non-peptide agents, which are able to 

10 disnipt a biological activity of an HDx polypeptide of the present invention, e.g. as 
catalytic inhibitor or an inhibitor of protein-protein interactions. Thus, such mutagenic 
techniques as described above are also useful to map the determinants of the HDx 
proteins vAiich participate in protein-protein or protein-DNA interactions involved in, for 
example, interaction of the subject HDx polypeptide with histones. RbAp48 or 

15 cytoskeletal elements. To illustrate, the critical residues of a subject HDx polypeptide 
>^ch are involved in molecular recognition of histones can be determined and used to 
generate //Z>r-derived peptidomimetics which competitively inhibit binding of the 
authentic HDx protein with that moiety. Likewise, residues of a histone or of RbAp48 
involved in binding to HDx proteins can be identified, and peptides or peptidomimetics 

20 based on such residues can also be used as competitive inhibitors of the interaction of an 
HDx protein with either of those proteins. By employing, for example, scanning 
mutagenesis to map the amino acid residues of a protein which is involved in binding 
other protdns, peptidomimetic compounds can be generated which mimic those residues 
which fiicilitate the interaction. Such mimetics may then be used to interfere with the 

25 normal fiinction of an HDx protein. For instance, non-hydrolyzable peptide analogs of 
such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), azepine (e.g., see HuflSnan et al, in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gamma 

30 lactam rings (Garvey et al, in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netheriands, 1988), keto^methylene pseudopeptides 
(Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure 
and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical 
Co. Rockland, IL, 1985), P-tum dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 

35 26:647; and Sato et al. {\9%6) J Chem Soc Perkin Trans 1:1231), and p-aminoalcohols 
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(Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and Daim et al. (1986) 
Biochem Biophys Res Commun 134:71). 

Another aspect of the invention pertains to an antibody specifically reactive Avith 
an HDx protein. For example, by using immunogens derived fi'om an HDx protein, e.g. 
5 based on the cDNA sequences, anti-protein/antl-peptide antisera or monoclonal 
antibodies can be made by standard protocols (See, for example, Antibodies: A 
Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A 
mammal, such as a mouse, a hamster or rabbit can be immunized with an immunogenic 
form of the peptide (e.g., an HDx polypeptide or an antigenic fiagment which is capable 

10 of eliciting an antibody response). Techniques for conferring immunogenictty on a 
protein or peptide include conjugation to carriers or other techniques well known in the 
art. An immunogenic portion of an HDx protein can be administered in the presence of 
adjuvant. The progress of immunization can be monitored by detection of antibody titers 
in plasma or serum. Standard ELISA or other immunoassays can be used with the 

15 immunogen as antigen to assess the levels of antibodies. In a preferred embodiment, the 
subject antibodies are immunospecific for antigenic determinants of an HDx protein of a 
organism, such as a mammal, e.g. antigenic determinants of a protein represented by one 
of SEQ ID Nos: 5-8 or closely related homologs (e.g. at least 85% homologous, 
preferably at least 90% homologous, and more preferably at least 95% homologous). In 

20 yet a fiirther preferred embodiment of the present invention, in order to provide, for 
example, antibodies which are immuno-sdective for discrete HDx homologs, e.g. HDl, 
the anti-ffl)x polypeptide antibodies do not substantially cross react (i.e. does not react 
specifically) with a protein which is, for example, less than 85%, 90% or 95% 
homologous with the selected HDx, By "not substantially cross react", it is meant that 

25 the antibody has a binding affinity for a non-homologous protein which is at least one 
order of magnitude, more preferably at least 2 orders of magnitude, and even more 
preferably at least 3 orders of magnitude less than the binding affinity of the antibody for 
the intended target HDx, 

Following immunization of an animal with an antigenic preparation of an HDx 
30 polypeptide, anti-^Z>r antisera can be obtained and, if desired, polyclonal mx-HDx 
antibodies isolated fi-om the serum. To produce monoclonal antibodies, antibody- 
produdng cells (lymphocytes) can be harvested fi^om an immunized animal and fused by 
standard somatic cell fusion procedures with immortalizing cells such as myeloma cells to 
yield hybridoma cells. Such techniques are well kno\yn in the art, an mclude, for 
35 example, the hybridoma technique (originally developed by Kohler and Milstein. (1975) 
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Nature, 256: 495-497), the human B cell hybridoma technique (Kozbar et al., (1983) 
Immunology Today, 4: 72), and the EBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et al., (1985) Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc. pp. 77-96). Hybridoma cells can be screened inununochemically for 
5 production of antibodies specifically reactive with an HDx polypeptide of the present 
invention and monoclonal antibodies isolated from a culture comprising such hybridoma 
cells. 

The term antibody as used herein is intended to indude fragments thereof vrfiich 
are also specifically reactive with one of the subject HDx polypeptides. Antibodies can be 

10 fragmented using conventional techniques and the fingments screened for utility in the 
same manner as described above for u^ole antibodies. For example, F(ab)2 fragments 
can be generated by treating antibody with pepsin. The resulting F(ab)2 firagment can be 
treated to reduce disulfide bridges to produce Fab fragments. The antibody of the 
present invention is fiirther intended to include bispecific and chimeric molecules having 

15 aflfinity for an HDx protein conferred by at least one CDR region of the antibody. 

Both monoclonal and polyclonal antibodies (Ab) directed against authentic HDx 
polypeptides, or HDx variants, and antibody fragments such as Fab, F(ab)2, Fv and scFv 
can be used to block the action of one or more HDx proteins and allow the study of the 
role of these proteins in, for example, differentiation of tissue. Experiments of this nature 
20 can aid in deciphering the role of HDx proteins that may be involved in control of 
proliferation versus differentiation, e.g., in patterning and tissue formation. 

Antibodies which specifically bind HDx epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of each of the subject HDx polypeptides. Anti-Zffib: antibodies can 

25 be used diagnostically in immuno-precipitation and immuno-blotting to detect and 
evaluate HDx protein levels in tissue as part of a clinical testing procedure. For instance, 
such measurements can be useful in prediaive valuations of the onset or progression of 
proliferative or differentiative disorders. Likewise, the ability to monitor HDx protein 
levels in an individual can allow determination of the efficacy of a given treatment 

30 regimen for an individual afflicted with such a disorder. The level of HDx polypeptides 
may be measured from cells in bodily fluid, such as in samples of cerebral spinal fluid or 
amniotic fluid, or can be measured in tissue, such as produced by biopsy. Diagnostic 
assays using anti-ZfDx antibodies can include, for example, immunoassays designed to aid 
in early diagnosis of a disorder, particularly ones which are manifest at birth. Diagnostic 
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assays using anti-MJr polypeptide antibodies can also include immunoassays designed to 
aid in early diagnosis and phenotyping neoplastic or hyperplastic disorders. 

Another application of anti-MJr antibodies of the present invention is in the 
immunological screening of cDNA libraries constructed in expression vectors such as X 
5 gtll. ;^gtl8-23. XZAP, and X0RF8. Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
proteins. For instance. Xgtll will produce fusion proteins whose amino ttttmWiksC- 
of B-galactosidase amino acid sequences and whose carboxy termini consist of a foreign 
pobrpeptide. Antigenic epitopes of an HDx protein, e.g. other orthologs of a particular 
HDx protem or other paralogs from the same species, can then be detected with 
antibodies, as. for example, reacting nitroceUulose filters lifted from infected plates with 
^u.HDx antibodies. Positive phage detected by this assay can then be isolated from the 
infected plate. Thus, the presence of HDx homologs can be detected and cloned from 
other animals, as can alternate isoforms Cmcluding splicing variants) from humans. 
15 Moreover, the nucleotide sequences determined from the cloning of HDx genes 

from organisms wiU further allow for the generation of probes and primers designed for 
use m Identifying and/or cloning HDx homologs in other ceU types. e.g from other 
tissues, as well as HDx homologs from other organisms. For instance, the present 
invenuon also provides a probe/primer comprising a substantially purified 
20 ohgonucleotide. which oligonucleotide comprises a region of nucleotide sequence that 
hybndizes under stringent conditions to at least 10 consecutive nucleotides of sense or 
anti-sense sequence selected from the group consisting of SEQ ID Nos: 1-4 or natuially 
o^g mutants thereof For instance, primers based on the nucleic acid represented in 
SEQ ID Nos: 1-4 can be used in PGR reactions to clone HDx homologs. Likewise 
25 probes based on the subject HDx sequences can be used to detect transcripts or genomic 
sequences encoding the same or homologous proteins. In preferred embodiments the 
probe further comprises a label group attached thereto and able to be detected e g the 
label group ,s selected from amongst radioisotopes, fluorescent compounds, enzymes 
and enzyme co-factors. 

«0 Such probes can also be used as a part of a diagnostic test Idt for identifying cells 

or tissue which misexpress an HDx protein, such as by measuring a level of an HDx- 
encodmg nucleic acid in a sample of cells fi.,m a patient; e.g. detecting HDx mRNA 
levels or determining whether a genomic HDx gene has been mutated or deleted 
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To illustrate, nucleotide probes can be generated from the subject HDx genes 
which facilitate histological screening of intact tissue and tissue samples for the presence 
(or absence) of /ff>x-encoding transcripts. Similar to the diagnostic uses of anti-jKDr 
antibodies, the use of probes directed to HDx messages, or to genomic HDx sequences, 
5 can be used for both predictive and therapeutic evaluation of allelic mutations which 
might be manifest in, for example, neoplastic or hyperplastic disorders (e.g. unwanted cell 
.growth) or abnormal differentiation of tissue. Used in conjunction ^with immunoassays as 
described above, the oligonucleotide probes can help facilitate the determination of the 
molecular basis for a developmental disorder which may involve some abnormality 
10 associated with expression (or lack thereof) of an HDx protein. For instance, variation in 
polypeptide synthesis can be differentiated from a mutation in a coding sequence. 

Accordingly, the present method provides a method for determining if a subject is 
at risk for a disorder characterized by aberrant cell proliferation and/or differentiation. In 
preferred embodiments, method can be generally characterized as comprising detecting, 

15 in a sample of cells from the subject, the presence or absence of a genetic lesion 
characterized by at least one of (i) an alteration affecting the integrity of a gene encoding 
an /ffix-protein, or (ii) the mis-expression of the HDx gene. To illustrate, such genetic 
lesions can be detected by ascertaining the existence of at least one of (i) a deletion of 
one or more nucleotides from an HDx gene, (ii) an addition of one or more nucleotides to 

20 an HDx gene, (iii) a substitution of one or more nucleotides of an HDx gene, (iv) a gross 
chromosomal rearrangement of an HDx gene, (v) a gross alteration in the level of a 
messenger RNA transcript of an HDx gene, (vii) aberrant modification of an HDx gene, 
such as of the methylation pattern of the genomic DNA, (vii) the presence of a non-wild 
type splicing pattern of a messenger RNA transcript of an HDx gene, (viii) a non-wild 

25 type level of an /TDx-protein, and (be) inappropriate post-translational modification of an 
ffl5x-protein. As set out below, the present invention provides a large number of assay 
techniques for detecting lesions in an HDx gene, and importantly, provides the ability to 
discern between different molecular causes underlying ^£bc-dependent aberrant cell 
growth, proliferation and/or differentiation. 

30 In an exemplary embodiment, there is provided a nucleic acid composition 

comprising a (purified) oligonudeotide probe including a re^on of nucleotide sequence 
which is capable of hybridizing to a sense or antisense sequence oiznHDx gene^ such as 
represented by any of SEQ ID Nos; 1-4, or naturally occurring mutants thereof^ or 5' or 
3' flanking sequences or intronic sequences naturally associated with the subject HDx 

35 genes or naturally pccurring mutants thereof The nucleic acid of a cell is rendered 
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accessible for hybridization, the probe is exposed to nucleic add of the sample, and the 
hybridization of the probe to the sample nucleic acid is detected. Such techniques can be 
used to detect lesions at either the genomic or mRNA level, including deletions, 
substitutions, etc., as well as to determine mRNA transcript levels. 

5 In certain embodiments, detection of the lesion comprises utilizing the 

probe/primer in a polymerase chain reaction (PGR) (see, e.g. U.S. Patent Nos. 4,683,195 
and 4,683,202), such as anchor PGR or RAGE PGR, or, alternatively, in a ligation chain 
reacuon (LCR) (see, e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al. (1944) PNAS 91:360-364), the later of which can be particulariy useful 

10 for detecting point mutations in the HDx gene. In a merely illustrative embodiment, the 
method includes the steps of 0) collecting a sample of cells from a patient, ^i) isolating 
nucleic acid (e.g., genomic, mRNA or both) from the ceUs of the sample, (iii) contacting 
the nucleic acid sample with one or more primers which ^jecifically hybridize to an HDx 
gene under conditions such that hybridization and amplification of the HDx gene (if 

15 present) occurs, and (iv) detecting the presence or absence of an amplification product, 
or detecting the size of the amplification product and comparing the length to a control 
sample. 

In still another embodiment, the level of an //Dx-protdn can be detected by 
immunoassay. For instance, the cells of a biopsy sample can be lysed, and the level of an 

20 fllbf-protein present in the cell can be quantiuted by standard immunoassay techniques. 
In yet another exemplary embodiment, aberrant methylation patterns of an HDx gene can 
be detected by digesting genomic DNA fiiom a patient sample with one or more 
restriction endonucleases that are sensitive to methylation and for which recognition sites 
exist in the HDx gene Cmcluding in the flanking and intronic sequences). See, for 

25 example. Bulling et al. (1994) Human Mol Genet 3:893-895. Digested DNA is separated 
by gel electrophoresis, and hybridized with probes derived from, for example, genomic or 
cDNA sequences. The methylation status of the HDx gene can be determined by 
comparison of the restriction pattern generated from the sample DNA vnth that for a 
standard of known methylation. 

30 In yet another aspect of the invention, the subject HDx polypeptides can be used 

to generate a "two hybrid" assay or an "interaction trap" assay (see, for example, U.S. 
Patent No. 5,283.317; Zervos et al. (1993) Gdl 72:223-232; Madura et al. (1993) J Biol 
Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; and Brent WO94/10300), for isolating coding sequences 

35 for other ceUular proteins which bind HDxs ("i7i>r-binding proteins" or "JXDr-bp"). 
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Such //Dx-binding proteins would likely be involved in the regulation ofHDx, e.g., as 
regulatory subunits or transducers, or be substrates which are regulated by an/ffibc. 

Briefly, the interaction trap relies on reconstituting in vivo a functional 
transcriptional activator protein from two separate fusion proteins. In particular, the 
5 method makes use of chimeric genes which express hybrid proteins. To illustrate, a first 
hybrid gene comprises the coduig sequence for a DNA-binding domain of a 
transcriptional activator fused in frame to the ooBmg sequence for an HDx polypeptide. 
The second hybrid protein encodes a transcriptional activation domain fused m frame to a 
sample gene from a cDNA library. If the bait and sample hybrid proteins are able to 
10 interact, e.g., form an jfiTftc-dependent complex, they bring into close proximity the two 
domains of the transcriptional activator. This proximity is suflSdent to cause 
transcription of a reporter gene which is opmbly linked to a transcriptional regulatory 
site responsive to the transcriptional activator, and expression of the reporter gene can be 
detected and used to score for the interaction of the HDx and sample protdns. 

15 Furthermore, by making available purified and recombinant HDx polypeptides, 

the present invention facilitates the development of assays which can be used to screen 
for dnigs, including HDx homologs, which are either agonists or antagonists of the 
normal cellular function of the subject HDx polypeptides, or of then- role in the 
pathogenesis of cellular differentiation and/or proliferation and disorders related thereto. 

20 Moreover, because we have also identified //Z>x-related proteins, such as the yeast RPD3 
protdns, as histone deacetyiases, the present invention further pro\ddes drug screening 
assays for detecting agents which modulate the bioactivity of fflJxr-rdated proteins. Such 
agents, when directed to, for example, fungal HDx-TtlBied proteins, can be used in the 
treatment of various infections. In a general sense, the assay evaluates the ability of a 

25 compound to modulate binding between an HDx polypeptide and a molecule, be it 
protein or DNA, that interacts with the HDx polypeptide. It will be apparent from the 
following description of exemplary assays that, in place of a human (or other mammalian) 
HDx protein, the assay can be derived with an //ZJr-related protein such as RPD3. 
Likewise, in place of human RbAp48 or Sin3A, other //Z>x-binding proteins can be used, 

30 e.g., other human proteins. Exemplary compounds which can be screened include 
peptides, nucleic acids, carbohydrates, small organic molecules, and natural product 
extract libraries, such as isolated from animals, plants, fungus and/or microbes. 

It is contemplated that any of the novel interactions described herein could be 
exploited in a drug screening assay. For example, in one embodiment, the interaction 
35 between an HDx protein and RbAp48 can be detected in the presence and the absence of 
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a test compound. In another embodiment, the ability of a compound to modulate the 
binding of an HDx protein, or //Dx-related protein such as the yeast RPD3, with histones 
can be assessed. The identification of a test compound which influences, for example, 
HDl catalyzed deacetylation of histones would be useful in the modulation of HDl 
5 activity in mammalian ceUs, while the idenUfication of a test compound which selectively 
inhibits the yeast RPD3 deacetylase activity would be useful as an antifungal agent. In 
other embodiments the eflfect of a test compoun(Un4he . binding of an HDx protein to 
other molecules, such as cytoskdetal components, or other proteins identified by the 
/ffir-dependent ITS set out above, could be tested. A variety of assay formats wiU 
10 suffice and, in light of the present inventions, wiU be comprehended by a skilled artisan. 

In a preferred embodiment, assays which employ the subject mammalian HDx 
proteins can be used to identify compounds that have therapeutic indexes more favorable 
than sodium butyrate, trapoxin, trichostatin or the like. For bstance. trapoxin-like drugs 
can be identified by the present invention which have enhanced tissue-type or cell-type 
specificity relative to trapoxin. To illustrate, the subject assays can be used to generate 
compounds which preferentially inhibit IL-2 mediated proliferation/activation of 
lymphocytes, or inhibit proliferation of certain tumor cells, without substantially 
interfering with other tissues, e.g. hepatocytes. Likewise, similar assays can be used to 
identify drugs which inhibit proliferation of yeast ceUs or other lower eukaiyotes, but 
which have a substantiaUy reduced effect on mammalian ceUs, thereby improving 
therapeutic index of the drug as an anti-mycotic agent. 

In one embodiment, the identification of such compounds is made possible by the 
use of differential screening assays which detect and compare drug-mediated inhibition of 
deacetylase activity between two or more different HDx-Yxk^ enzymes, or compare drug- 
mediated inhibition of formation of complexes invohnng two or more different types of 
ffiJr-Iike proteins. To illustrate, the assay can be designed for side-by-side comparison 
of the effect of a test compound on the deacetylase activity or protein interactions of 
tissue-type specific HDx proteins. Given the apparent diversity of HDx proteins, it is 
probable that different fijnctional HDx activities, or HDx complexes exist and. in certain 
instances, are localized to particular tissue or cell types. Thus, test compounds can be 
screened for agents able to inhibit the tissue-specific formation of only a subset of the 
possible repertoire of ^Z>r/regulatory protein complexes, or which preferentially inhibit 
certain HDx enzymes. In an exemplary embodiment, an imeraction trap assay can be 
derived using two or more different human HDx "bait" proteins, while the "fish" protein 
is constant in each, e.g. a human RbAp48 construct. Running the interaction trap side- 
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by-side permits the detection of agents which have a greater effect (e.g. statistically 
significant) on the formation of one of the ffl>x/RbAp48 complexes than on the formation 
of the other complexes. 

In similar fashion, differential screening assays can be used to exploit the 
5 difference in protein interactions and/or catalytic mechanism of mammalian HDx proteins 
and yeast RPD3 proteins in order to identify agents which display a statistically significant 
increase in specificity for inhibiting the yeast enzyme relative to the mammalian enzyme. 
Thus, lead compounds which act specifically on pathogens, such as fungus involved in 
mycotic infections, can be developed. By way of illustration, the present assays can be 
used to screen for agents which may ultimately be useful for inhibiting at least one fungus 
implicated in such mycosis as candidiasis, aspergillosis, mucormycosis, blastomycosis, 
geotrichosis, cryptococcosis, chromoblastomycosis, coccidioidomycosis, conidiosporosis, 
histoplasmosis, maduromycosis, rhinosporidosis, nocaidiosis, para-actinomycosis, 
penicilliosis, monoliasis, or sporotridiosis. For example, if the mycotic infection to which 
treatment is desired is candidiasis, the present assay can comprise comparing the relative 
effectiveness of a test compound on inhibiting the deacetylase activity of a mammalian 
HDx protein with its effectiveness towards inhibiting the deacetylase activity of an RPD3 
homolog cloned fi-om yeast seleaed fi-om the group consisting of Candida albicans, 
Candida siellatoidea, Candida tropicalis, Candida parapsilosis, Candida krusei, 
Candida pseudotropicalis, Candida quiliermondii, or Candida rugosa. Likewise, the 
present assay can be used to identify anti-fungal agents which may have therapeutic value 
in the treatment of asper^llosis by selectively targeting RPD3 homologs cloned firom 
yeast such as Aspergillus Jumigatus, Aspergillus flavus, Aspergillus niger, 
Aspergillus nidulans^ or Aspergillus terreus. Where the mycotic infection is 
mucormycosis, the JtPD3 deacetyhise can be derived fi-om yeast such as Rhizopus 
arrhizus, Rhizopus oryzae^ Absidia corymbifera, Absidia ramosa, or Mucor pusillus. 
Sources of other /?!PZ)3 activities for comparison with a mammalian HDx activity includes 
the pathogen Pneumocystis carinii. 

In addition to such HDx therapeutic uses, anti-fimgal agents developed with such 
differentia] screening assays can be used, for example, as preservatives in foodstuff, feed 
supplement for promoting weight gain in livestock, or in disinfectant formulations for 
treatment of non-living matter, e.g., for decontaminating hospital equipment and rooms. 

In similar fashion, side by side comparison of inhibition of a mammalian HDx 
proteins and an insect //Dx-related proteins, will permit selection of HDx inhibitors which 
discriminate between the human/mammalian and insect enzymes. Accordingly, the 
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present invention expressly contemplates the use and formulations of the subject HDx 
therapeutics in insecticides, such as for use in management of inserts like the fruit fly. 

In yet another embodiment, certain of the subject HDx inhibitors can be selerted 
on the basis of inhibitory specificity for plant /f£br-related activities relative to the 
5 mammalian enzyme. For example, a plant /ffibc-related protein can be disposed in a 
differential screen witii one or more of the human enzymes to sdert those compounds of 
greatest selectivity for inhibiting the plant enzyme. Thus, tiie present invention 
specifically contemplates formulations of the subjert HDx inhibitors for agricultural 
applications, such as in the form of a defoliant or the like. 

10 In many drug screening programs which test libraries of compounds and natural 

extrarts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived witii purified or semi-purified proteins, are often 
preferred as "primary" screens in that they can be generated to permit rapid development 
15 and relatively easy detection of an alteration in a molecular target which is mediated by a 
test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being 
focused primarily on the eflfert of the drug on tiie molecular target as may be manifest in 
an alteration of binding affinity with upstream or downstream elements. Accordingly, in 
an exemplary saeening assay of the present invention, a reaction mixture is generated to 
include an /©X polypeptide, compound(s) of interest, and a "target polypeptide", e.g., a 
protein, which interacts witii the HDx polypeptide, whether as a substrate or by some 
otijer protein-protein interaction. Exemplary target polypeptides include histones, 
RbAp48 polypeptides, Sin3 polypeptides, and/or combinantions thereof or witii other 
transciptional regulatory proteins (such as myc, max. etc, see Example 3)). Detection 
and quantification of complexes containing tiie HDx protein provide a means for 
determining a compound's efficacy at inhibiting (or potentiating) complex formation 
between tiie HDx and tiie target polypeptide. The efiBcacy of the compound can be 
assessed by generating dose response curves from data obtained using various 
concenti-ations of tiie test compound. Moreover, a control assay can also be performed 
to provide a baseline for comparison. In tiie control assay, isolated and purified HDx 
polypeptide is added to a composition containing tiie target polypeptide and the 
formation of a complex is quantitated in tiie absence of the test compound. 

Complex formation between tiie HDx polypeptide and the target polypeptide may 
be detected by a variety of techniques. Modulation of tiie fomation of complexes can be 
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quantitated using, for example, detcctably labeled proteins such as radiolabeled, 
fluorescently labeled, or enzymatically labeled HDx polypeptides, by immunoassay, by 
chromatographic detection, or by detecting the intrinsic activity of the acetylase. 

Typically, it will be desirable to inunobilize either HDx or the target polypeptide 
5 to facilitate separation of complexes from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of HDx to the 
target polypeptide, in the presence and absence of a candidate agent, can be 
accomplished in any vessel suitable for contaming the reactants. Examples include 
microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 

10 protein can be provided which adds a domain that allows the prote'm to be bound to a 
matrix. For example, gIutathione-S-transferase/ffl>x (GST/HDx) fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell lysates, 
e.g. an ^^S-labeled, and the test compound, and the mbcture incubated under conditions 

15 conducive to complex formation, e.g. at physiolo^cal conditions for salt and pH, though 
slightly more stringent conditions may be desired. Following incubation, the beads are 
washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly (e.g. beads placed in scintillant), or in the supernatant afler the 
complexes are subsequently dissociated. Alternatively, the complexes can be dissociated 

20 from the matrix, separated by SDS-PAGE, and the level of //Z>r-binding protein found in 
the bead fraction quantitated from the gel using standard electrophoretic techniques such 
as described in the appended examples. 

Other techniques for immobilizing proteins on matrices are also available for use 
in the subject assay. For instance, either HDx or target polypeptide can be immobilized 

25 utilizing conjugation of biotin and streptavidin. For instance, biotinylated HDx molecules 
can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well known 
in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford, IL), and inmiobilized in the 
wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies 
reactive with HDx, but which do not interfere with the interaction between the HDx and 

30 target polypeptide, can be derivatized to the wells of the plate, and HDx trapped in the 
wells by antibody conjugation. As above, preparations of an target polypqitide and a test 
compound are incubated in the M)x-presenting wells of the plate, and the amount of 
complex trapped in the well can be quantitated. Exemplary methods for detecting such 
complexes, in addition to those described above for the GST-unmobilized complexes, 

35 include immunodetection of complexes using antibodies reactive with the target 
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polypeptide, or which are reactive with HDx protein and compete with the target 
polypeptide; as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the target polypeptide, either intrinsic or extrinsic activity. In the 
instance of the latter, the enzyme can be chemically conjugated or provided as a fusion 
5 protein with the target polypeptide. To illustrate, the target polypeptide can be 
chemically cross-linked or genetically fused with horseradish peroxidase, and the amount 
of polypeptide trappaljnjhe complex can be assessed with a chromogenic substrate of 
the enzyme, e.g. 3,3'-diamino-benzadine terahydrochloride or 4-chloro-l-napthol. 
Likewise, a fusion protein comprising the polypeptide and glutathione-S-transferase can 
10 be provided, and complex formation quantitated by detecting the GST activity using 1- 
chloro-2,4-dinitrobenzene (Habig et al (1974) J Biol Chem 249:7130). 

For processes which rely on immunodetection for quantitating one of the proteins 
trapped in the complex, antibodies against the protein, such as anti-fffibc antibodies, can 
be used. Alternatively, the protein to be detected in the complex can be "epitope tagged" 

15 in the form of a fusion protein which includes, in addition to the HDx sequence, a second 
polypeptide for which antibodies are readily available (e.g. from commercial sources). 
For instance, the GST fusion proteins described above can also be used for quantification 
of binding using antibodies against the GST moiety. Other useful epitope tags include 
myc-epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21150-21157) which 

20 includes a 10-residue sequence from c-myc, as well as the pFLAG system (International 
Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, NJ). 

In another embodiment of a drug screening, a two hybrid assay can be generated 
with an HDx and i/£)x-binding protein. Drug dependent inhibition or potentiation of the 
interaction can be scored. 



25 



Where the HDx proteins themselves, or in complexes with other protons, are 
capable of binding DNA and modifying transcription of a gene, a transcriptional based 
assay uang, for example, an transcriptional regulatory sequences responsive to HDx 
complexes operably linked to a detectable marker gene. For iUustration, see Example 3. 

To test the effect of a histone deacetylase inhibitor on MadN35GALVP16 and 
30 Mad(Pro)N35GALVPI6 mediated repression, we treated a duplicate set of transfections 
with 10 nM trapoxin for eight hours prior to harvest. In the representative experiment 
shown, 10 nM trapoxin treatment derepressed the activity of MadN35GALVP16 nine- 
fold while it had little effect on the activity of Mad(Pro)N35GALVP16, suggesting that 
the histone deacetylation plays a direct role in mSin3A transcriptional repression (Figure 
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13B). In addition, there was typically less than a two-fold effect of trapoxin on the 
activity of the reporter cene in cells transfectcd vnih the expression vector alone or in 
cells transfectcd with GALVP16 (data not shown). Followng trs^oxin treatment, the 
repression observed for MadN35GALVPl6 was still seven times greater than that of 
5 Mad(Pro)N35GALVP16, suggesting that the residual deacetylase activity following 
trapoxin treatment (Figure 13B) continues to drive mSin3A-mediated repression; 
howeverr we can not rule out that mSin3A is capable of repression by mechanisms 
independent of histone deacetylation. 

10 Furthermore, each of the assay systems set out above can be generated in a 

"differential" format as set forth above. That is, the assay format can provide information 
regarding spedficity as well as potency. For instance, side-by-side comparison of a test 
compound's eflfect on different HDxs can provide information on selectivity, and pennit 
the identification of compounds which selectively modulate the bioactivity of only a 

1 5 subset of the HDx family. 

Furthermore, inhibitors of the enzymatic activity of eadi of the subject HDx 
proteins can be identified using assays derived ft-om measuring the ability of an agent to 
inhibit catalytic conversion of a substrate by the subject proteins. For example, the ability 
of the subject HDx proteins to deacetylate a histone substrate, such as histone H4 (see 
20 examples), in the presence and absence of a candidate inhibitor, can be determined using 
standard enzymatic assays. 

A number of methods have been employed *m the art for assa^g histone 
deacetylase activity, and can be incorporated in the drug screening assays of the present 
invention. In preferred embodiments, the assay will employ a labded acetyl group linked 

25 to appropriate histone lysine residues as substrates. In other embodiments, a histone 
substrate peptide can be labeled with a group whose signal is dependent on the 
simultaneous presence or absence of an acetyl group, e.g., the label can be a fluorogenic 
group whose fluorescence is modulated (either quenched or potentiated) by the presence 
of the acetyl moiety. Using standard enzymatic analysis, the ability of a test agent to 

30 cause a statistically significant change in substrate conversion by a histone deacetylase 
can be measured, and as desirable, inhibition constants, e.g., Kj values, can be calculated. 
The histone substrate can be provided as a purified or semi-purified polypeptide or as 
part of a cell lysate. Likewise, the histone deacetylase can be provided to the reaction 
mbcture as a purified or semi-purified polypeptide or as a cell lysate. Accordingly, the 
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reaction mixtures of the subject method can range from reconstituted protein mixtures 
derived with purified preparations of histones and deacetylases, to mixtures of ceU 
lysates, e.g., by admixing baculovirus lysates containing recombinant histones and 
deacetylases. 

5 In an exemplary embodiment, the histone substrate for the subject assay is 

provided by isolation of radiolabeled histones from metaboUcally labeUed ceUs. To 
illustrate, as described by Hay et al. (1983) J Biol Chem 258:3726-3734, HeU ceUs can 
be labelled in culture by addition of pHjacetate (New England Nuclear) to the culture 
media. The addition of butyrate, trapoxin or the like can be used to increase the 

10 abundance of acetylated histones in the cells. Radiolabelled histones can be isolated from 
the cells by extraction with HsS04 (Marushige et al. (1966) J Mol Biol 15:160-174). 
Briefly, cells are homogenized in buffer, ccntrifiiged to isolate a nuclear peUet, the 
subsequently homogenized nuclear peUet centrifuged through sucrose, and the resulting 
chromatin peUet extracted by addition of HsS04 to yield piqacetyl-labeUed histones. In 

15 an alternate embodiment, nudeosome preparations containing pH]acelyl-labelled 
histones can be isolated from the labelled ceUs. As described in the art, nucleosomes can 
be isolated from cell preparations by sucrose gradient centrifiigation (Hay et al. (1983) J 
Biol Chem 258:3726-3734; and NoU (1967) Nature 215:360-363), and polynucleosomes 
can be prepared by NaCl precipitation from micrococcal nuclease digested cells (Hay et 
20 al., supra). Similar procedures for isolating labelled histones from other cells types, 
including yeast, have been described. See. for example, Alonso et al. (1986) Biochem 
Biophys Acta Z66:l6\-\69; and Kreiger et al. (1974) y.S/o/CAe»i 249:332-334. In yet 
other embodiments, the histone is generated by recombinant gene expression, and 
includes an exogenous tag (e.g., an HA epitope, a poly(his) sequence or the like) which 
25 faciUtates in purification from cdl extracts. In still other embodiments, whole nuclei can 
be isolated from metabolically labeUed cdls by micrococcal nudease digestion (Hay et al., 
supra) 

In stiU another embodiment, the deacctylase substrate can be provided as an 
acetylated peptide induding a sequence corresponding to the sequence about the specific 

30 lysyl residues acetylated on histone, e.g., a pepUdyl portions of the core histones H2A. 
H2B, H3 or H4. Such fragments can be produced by deavage of acetylated histones 
derived from metabolically labdled cells, e.g., such as by treatment with proteolytic 
enzymes or cyanogen bromide (Kreiger et al.. supra). In other embodiments, the 
acetylated peptide can be provided by standard solid phase synthesis using acetylated 

35 lysine residues (Kreiger et al., jM/?ra). 
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Continuing with the illustrative use of [3H]acetyl.labelled histones, the activity of 
a histone deacetylase in the subject assays is detected by measuring release of [3HIacetate 
by standard scintillant techniques. In a merely illustrative example, a reaction mixture is 
provided which comprises a recombinant HDx protein suspended in buffer, along with a 
5 sample of pHJacetyl-labelled histones and (optionally) a test compound. The reaction 
mixture is maintained at a desired temperature and pH, such as 22*C at pH7.8, for 
- - several hours, and the reaction terminated by boiling or other form of denaturation. 
Released [3H]acetate is extracted and counted. For example, the quenched reaction 
mixture can be acidified with concemrated HCl, and used to create a biphasic mixture 
10 with ethyl acetate. The resulting 2 phase system is thoroughly mixed, centrifoged. and 
the ethyl acetate phase collected and counted by standard scintillation methods. Other 
methods for detecting acetate release wiU be easily recognized by those skdUed in the art. 

In yet anoUier embodiment, the dnig screening assay is derived to include a whole 
ceU recombinantiy expressing one or more of a target protein or HDx protein. The ability 
15 of a test agent to alter the activity of the HDx protein can be detected by analysis of the 
recombinant cell. For example, agonists and antagonists of the HDx biological activity 
can by detected by scoring for alterations in growth or differentiation (phenotype) of tiie 
cell. General techniques for detecting each are well known, and will vary witii respect to 
the source of the particular reagent cell utilized in any given assay. 

20 For example, quantification of proliferation of cells in the presence and absence of 

a candidate agent can be measured mrh a number of techniques well known in the art. 
including simple measurement of population growtii curves. For instance, where the 
assay involves proliferation in a liquid medium, turbidimetric techniques (i.e. absorbence/ 
transmittance of light of a given wavelength through the sample) can be utilized. For 

25 example, in tiie instance where the reagent cell is a yeast cell, measurement of absoibence 
of light at a wavelength between 540 and 600nm can provide a convenicnUy fast measure 
of cell growtii. Likewise, ability to form colonies in solid medium (e.g. agar) can be used 
to readily score for proliferation. In other embodiments, an HDx substrate protein, such 
as a histone, can be provided as a fusion protein which permits tiie subsu^te to be 

30 isolated from ceU lysates and the degree of acetylation detected. Each of tiiese 
techniques are suitable for high through-put analysis necessary for rapid screening of 
large numbers of candidate agents. 

In addition, where the ability of an agent to cause or reverse a transformed 
phenotype, growth in solid media such as agar can fiirther aid in establishing whetiier a 
35 mammalian cell is transformed. 
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Additionally, visual inspection of the morphology of the reagent cdl can be used 
to determine whether the biological aaivity of the targeted HDx protein has been affected 
by the added agent. To illustrate, the ability of an agent to influence an apoptotic 
phenotype which is mediated in some way by a recombinant HDx protein can be assessed 
5 by visual microscopy. Likewise, the formation of certain cellular structures as part of 
dififerentiation, such as the formation of neuritic process, can be visualized under a light 
microscope. 

The nature of the effect of test agent on reagent cell can be assessed by measuring 
levels of expression of specific genes, e.g., by reverse transcription-PCR. Another 

10 method of scoring for eflfect on Hdx activity is by detecting ccU-type spedfic marker 
expression through immunofluorescent staining. Many such markers are knovwi in the 
art, and antibodies are readily available. For example, the presence of chondroitin 
sulphate proteoglycans as well as type-II collagen are correlated with cartilage 
production in chondrocytes, and each can be detected by immunostaining. Similariy, the 

15 human kidney differentiation antigen gpl60, human ammopeptidase A, is a marker of 
kidney induction, and the cytoskeletal protein troponin I is a marker of heart induction. 
In yet another embodiment, the alteration of expression of a reporter gene construct 
provided in the reagent cell provides a means of detecting the eflfect on HDx activity. For 
example, reporter gene constructs derived using the transcriptional regulatory sequences, 

20 e.g. the promoters, for developmcntally regulated genes can be used to drive the 
expression of a detectable marker, such as a luciferase gene. In an illustrative 
embodiment, the construct is derived using the promoter sequence from a gene expressed 
in a particular diSerentiative phenotype. 

It is also deemed to be within the scope of this invention that the recombinant 
IS HDx cells of the present assay can be generated so as to comprise heterologous HDx 
proteins (i.e. cross-species expression). For example, HDx proteins from one species can 
be expressed in the cells of another under conditions wherein the heterologous protein is 
able to rescue loss-of-fimction mutations in the host cell. For example, the reagent cell 
can be a yeast cell in which a human MDx protein (e.g. exogenously expressed) is the 
!0 intended target for development of an anti-proliferative agent. To illustrate, the M778 
strain, A£4 ra ura$-52 trpJAl his3-200 Ieu2-J trklA rpd3A::HIS3, described by Vidal et 
al. (1991) Mol Cell Biol 6317-6327, which lacks a functional endogenous RPD3 gene 
can be transfected witii an expression plasmid including a manrunalian HDx gene in order 
to complement the RPD3 loss-of-function. For example, the coding sequence for HDI 
5 can be cloned into a pRS integrative plasmid containing a selectable marker (Sikorski et 
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al. (1989) Genetics 122:19-27), and resulting construct used to transform the M778 
strain. The resulting cells should produce a mammalian HDJ protein which may be 
capable performing at least some of the functions of the yeast RPD3 protein. The HDx 
transformed yeast cells can be easier to manipulate than manmialian cells, and can provide 
S access to certain assay formats, such as turbidity detection methods, which may not be 
obtainable with mammalian cells. 

Moreover, the combination of the "mammalianized" strain with the strain M53 7 
{AfATa ura3'52 trplAl hi$3'200 leul-I trklA, Vidal ct al., supra) can provide an 
exquisitely sensitive cell-based assay for detecting agent which spedfically inhibit, for 
10 example, the yeast RPD3 deacetylase. 

In another aspect, the invention provides compounds useful for inhibition of 
HDxs, In a preferred embodiment, an HDx inhibitor compound of the invention can be 
represented by the formula A-B-C, in which A is a specificity element for selective 
binding to an HDx, B is a linker element, and C is an electrophilic moiety capable of 
15 reacting with a nucleophilic moiety of an HDx\ with the proviso that the compound is not 
butyrate, trapoxin, or trichostatin. 

In another aspect, the invention provides an aflSnity matrix for binding or 
purifying an HDx, In a preferred embodiment, the aflSnity matrix can be represented by 
the formula S-A-B-C, in which S is a solid or insoluble support, and A, B, and C are as 

20 described above. The solid or insoluble support S can be any of a variety of supports, 
many of which are known in the art, for synthesis of, or immobilization of, compounds, 
e.g., peptides, benzodiazepines, and the like. For a review of solid-supported synthesis, 
see, e.g., Hodge et al. Polymer-supported Reactions in Organic Synthesis, John Wiley 
& Sons, New York, 1980. The HDx inhibitor moiety A-B-C can be bonded directly to 

25 the support S, or can be bonded to the support S through a linking or spacing moiety, as 
is known in the art. 

In another aspect, the invention provides a method of inhibiting an HDx. The 
method comprises contacting the HDx with a compound capable of inhibiting HDx 
activity, under conditions such that HDx activity is inhibited. In preferred embodiments, 
30 the compounds can be represented by the formula A-B-C, in which A, B, and C are as 
described above; with the proviso that the compound is not butyrate, trapoxin, or 
trichostatin. 

In another aspect, the invention provides a method of purifying an HDx, The 
method includes contacting a reaction mixture comprising an HDx with an affinity matrix 
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capable of selectively binding to an HDx, and separating at least one other component of 
the reaction mixture from the HDx. In a preferred embodiment, the aflfinity matrix can be 
represented by the formula S-A-B-C, m which S, A, and C are as described above. 

In general, the elements A, B, and C of the inhibitor compounds are selected to 
5 permit selective binding to, and inhibition of, at least one HDx, The elements A, B, and 
C can be selected to provide specificity for particular HDxs. For example, a series of 
candidate HDx inhibitor compounds can be synthesized, e.g., according to the 
combinatorial methods described infra^ and the library of candidate compounds screened 
against one or more HDxs to determine the compound or compounds with optimal 
1 0 activity and specificity for a particular HDx. 

Thus, in preferred embodiments, the specificity element A is selected such that the 
HDx inhibitor compound bmds selectively to an HDx. In general, the specificity clement 
A will be selected according to factors such as the binding spedficity of the HDx or HDxs 
to wWch the inhibitor compound should bind, ease of synthesis, stability in vivo or in 
15 vitro, and the like. In certain embodiments, the specifidty element A is a 
cyclotetrapeptidyl moiety. In athother embodiment, A is a substituted or unsubstituted 
aryl moiety. In yet another embodiment, A is a nonaromatic carboqrcle. In still another 
embodiment, A is an amino acyl moiety (e.g., a natural or non-natural amino acyl 
moiety). In yet another embodiment, A is a heterocyclyl moiety. 

In preferred embodiments, B is selected from the group consisting of substituted 
and unsubstituted C4-C8 alkylidene, C4-C8 alkenylidene, C4-C8 alkynylidene, and D-E-F, 
in which D and F are independently absent or C2-C7 alkylidene, C2-C7 alkenylidene, or 
C2-C7 alkynylidene, and E is O, S, or NR', in which R' is H, lower alkyl, lower alkenyl, 
lower alkynyl, aralkyl, aryl, or heterocyclyl. The element B should be selected to permit 
the specificity element A to interact with an HDx such that specific binding occurs, while 
poising the electrophilic moiety C for reaction with a nucleophilic moiety of the HDx. 

In a preferred embodiment, C is an electrophilic moiety that is approximately 
isosteric with an N-acetyl group (i.e., C has approximately the same steric bulk as an N- 
acetyl group)In preferred embodiments, the element C is capable of reacting, covalently 
or non-covalently, with a nucleophilic moiety of an HDx, In certain preferred 
embodiments, the element C is capable of binding (e.g., by chelation) to a metal ion, e.g., 
a divalent metal ion, e.g., zmc or calcium. In preferred embodiments, C is selected from 
the group consisting of a,3.epoxyketones, a,p-epoxythioketones, a,p-epoxysulfoxides, 
hydroxamic acids, a-haloketones, a-halothioketones, a-diazoketones, a- 
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diazothioketones, vinyl epoxides, trifluoromethylketone, trifluoromethylthioketone, 
enones (e.g., of ketones or thioketones), ynones (e.g., of ketones or thioketones), a,P- 
aziridinoketones, hydrazones, boronic adds, carboxylates, amides (e.g., -C(O)-amino), 
sulfones, aldehyde, alkyl halides, epoxides, and the like. 

5 In accordance with the foregoing, the moieties A, B, and C can illustratively be 

represented by the formulas depicted in Figure 6, in which Ri represents one or more 
substituents selected from the group consisting of amino, halogen, allg^l, alkenyl, alkynyl, 
aryl, aralkyl, heterocyciyl, azido, caAoxyl, alkoxycarbonyl, hydroxyl, alkoxy, cyano, 
trifluoromethyl, and the like; R" is CpCg alkylidene, C2-C8 alkenylidcne, or C2-Cg 

10 alkynylidene; R5 is hydrogen, alkyl, alkoxycarbonyl, aryloxycaibonyl, alkylsulfonyl, 
arylsulfonyl or aryl; % is hydrogen, alkyl, aryl, alkoxy, aryloxy, halogen, and the like; R'5 
is hydrogen, alkyl, alkenyl, alkynyl, aryl, and the like; R7 is hydrogen, alkyl, aiyl, alkoxy, 
aryloxy, amino, hydroxylamino, alkoxylamino, halogen, and the like; Rg is hydrogen, 
alkyl, halogen, and the like; R9 is hydrogen, alkyl, aryl, hydroxyl, alkoxy, aryloxy, amino, 

15 and the like; X is a good leaving group, c.g., diazo, halogen, a sulfate or sulfonate ester, 
e.g., a tosylate or mesylate, and the like; and Y is O or S. 

In certain preferred embodiments, an HDx inhibitor compound can be represented 
by the formula A-B-C, in which A is selected from the group consisting of cycloalkyls, 
unsubstituted and substituted aryls, heterocyclyls, amino acyls, and cyclotetrapeptides; B 

20 is selected from the group consisting of substituted and unsubstituted C4-Cg alkylidene, 
C4-Cg alkenylidene, C4-Cg alkynylidene, C4-Cg enyne, and D-E-F, in which D and F are 
independently absent or a C-C7 alkylidene, an C2-C7 alkenylidene, or an C2-C7 
alkynylidene, and E is 0, S, or MR', in which R represents H, a lower alkyl, a lower 
alkenyl, a lower alkynyl, an aralkyl, aryl, or a heterocyciyl; and C is selected from the 

25 group consisting of 



acid); in which Z represents O, S, or NR5, and Y, R5, R^, and R7 are as defined above. 
In preferred embodiments, R^ is hydrogen. In certain preferred embodimrats, B is not a 
C4-Cg alkylidene. In preferred embodiments, if B is a C4-Cg alkylidene, C is not a 
30 boronic acid. In other preferred embodiments, the inhibitor compound is not trapoxin. 

In certain preferred embodiments, an HDx inhibitor compound can be represented 
by the formula A-B-C, in which A is selected from the group consisting of cycloalkyls. 




, and B(OH)2 (boronic 
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unsubstituted and substituted aiyls, heterocyclyls, amino acyls, and cyclotetrapeptides; B 
IS selected from the group consisting of substituted and unsubstituted C4-C8 alkylidene, 
C4-C8 alkenylidene, C4-C8 alkynylidene, C4-C8 enyne, and D-E-F, in which D and F are 
independently absent or CpCy alkylidene, C2-C7 alkenylidene. or C2-C7 alkynylidene, 
and E is 0, S, or MR', in which R' represents H, a lower alkyl, a lower alkenyl, a lower 
alkynyl, an aralkyl, an aryl, or a heterocyclyl; and C is selected from the group consisting 



of 



H H 



, in which R9 is as defined above. In preferred 
embodiments. B is not a C4.C8 alkylidene. In preferred embodiments, the inhibitor 
10 compound is not trichostatin. 

In still another preferred embodiment, an HDx inhibitor compound can be 
represented by the formula A-B-C, in which A is selected from the group consisting of 
cycloalkyls, unsubstituted and substituted aryls, heterocyclyls, amino acyls, and 
cyclotetrapeptides; B is selected from the group consisting of substituted and 
15 unsubstituted C4-C8 alkylidene, C4-C8 alkenylidene. C4-C8 alkynyUdene. C4-C8 enyne. 
and D-E-F, in which D and F are independently absent or a C1-C7 alkylidene, a C2-C7' 
alkenylidene. or a C2-C7 alkynylidene, and E is 0. S. or MR', in which R' is H, lower 



alkyl. lower alkenyl, lower alkynyl. aralkyl, aryl, or heterocyclyl; and C is 
which Y is O or S, and R7 is as defined above. 




20 



Certain HDx inhibitor compounds of the present invention may exist in particular 
geometric or stereoisomeric forms. For example, amino acids can contain at least one 
chiral center. The present invention contemplates all such compounds, including cis- and 
trans-isomers, R- and S-enantiomers, diastereomers. the racemic mixtures thereof and 
other mbrtures thereof, as felling within the scope of the invention. Additional asymmetric 
25 carbon atoms may be present in a substituent such as an alkyl group. AU such isomers, as 
well as mbrtures thereof, are intended to be included in this invention. 

If, for instance, a particular enantiomer of a compound of tiie present invention is 
desired, it may be prepared by asymmetric synthesis, or by derivation with a chiral 
auxiliary, where the resulting diastereomeric mixture is separated and the auxiliary group 
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cleaved to provide the pure desired enantiomer. Alternatively, where the molecule 
contains a basic functional group, such as amino, or an acidic functional group, such as 
carboxyl, diastereomeric salts can be formed with an appropriate optically-active acid or 
base, followed by resolution of the diastereomers thus formed by fractional crystallization 
5 or chromatographic means well known in the art, and subsequent recovery of the pure 
enantiomers. 

The term "alkyl" refers to the radical ^oP&aturated aliphatic groups, including 
straight-chain alkyl groups, branched-chain alkyl groups, cydoalkyl (alicyclic) groups, 
alkyl substituted cydoalkyl groups, and cycloalkyl substituted alkyl groups. In preferred 
10 embodiments, a straight chain or branched chain alkyl has 30 or fewer carbon atoms m its 
backbone (e.g., C1-C30 for straight chain, C3-C30 for branched chain), and more 
preferably 20 or fewer. Likewise, preferred cycloalkyls have from 4-10 carbon atoms in 
their ring structure, and more preferably have 5, 6 or 7 carbons in the ring structure. 

Unless the number of caitons is otherwise specified, "lower alkyl" as used herein 
15 means an alkyl group, as defined above, but having from one to ten carbons, more 
preferably from one to sbc carbon atoms in its backbone structure. Likewise, "lower 
alkenyl" and "lower alkynyl" have similar chain lengths. Preferred alkyl groups are lower 
alkyls. In preferred embodiments, a substituent designated herein as alkyl is a lower 
alkyl. 

20 Moreover, the term "alkyl" (or "lower alkyl") as used throughout the specification 

and daims is intended to include both "unsubstituted alkyls" and "substituted alkyls". the 
latter of which refers to alkyl moieties having substituents replacing a hydrogen on one or 
more carbons of the hydrocarbon backbone. Such substituents can include, for example, 
halogen, hydroxyl, carbonyl (such as a carboxylate, alkoxycarbonyl, aryloxycarbonyl, 

25 alkylcarbonyl, arylcarbonyl, aldehyde, and the like), thiocarbonyl (such as a thioadd, 
alkoxycarbonyl, and the like), an alkoxyl, unsubstituted amino, mono- or disubstituted 
amino, amido, amidine, imine, nitro, arido, sulftiydryl, alkylthio, cyano, trifluoromethyl, 
sulfonato, sulfomoyl, sulfonamido, heterocyclyl, aralkyl, or an aromatic or heteroaromatic 
moiety. It will be understood by those skilled in the art that the moieties substituted on 

30 the hydrocarbon chain can themselves be substituted, as des^ibed above, if appropriate. 
Exemplary substituted alkyls are described below. Cydoalkyls can be further substituted 
with, e.g., alkyls, alkenyls, alkoxys, alkylthios, aminoalkyls, carbonyl-substituted alkj/Is, - 
CF3, -CN, and the like. 
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The terms "alkenyl" and "alkynyl" refer to unsaturated aliphatic groups analogous 
in length and possible substitution to the alkyls described above, but that contain at least 
one double or triple bond respectively. The term "enyne" refers to an unsaturated 
aliphatic moiety having at least one double bond and one triple bond. 

5 The terms "alkylidene," "alkenylidene," and "alkynylidene" are art-recognized and 

refer to moieties corresponding to alkyl, alkenyl, and alkynyl moieties as defined above, 
but having two valences available for bonding: ^ 

The term "aryl" as used herein includes 5-, 6- and 7-membered single-ring 
aromatic groups that may include from zero to four heteroatoms, for example, phenyl, 

10 pyrrolyl, furanyl, thiophenyl, imidazolyl, oxazolyl, thiazolyl, triazolyl, pyrazolyl, pyridyl, 
pyrazinyl, pyridazinyl and pyrimidyl, and the like. Those aryl groups having heteroatoms 
in the ring structure may also be referred to as "aryl heteroqrcles" or "heteroaromatics". 
The aromatic ring can be substituted at one or more ring positions with such substituents 
as described above, as for example, halogen, azido, alkyl, aralkyl, alkenyl, alkynyl, 

15 cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, carbonyl, carboxyl, silyl, 
ether, alkylthio, sulfonyl, sulfonamido, ketone, aldehyde, ester, a heterocyclyl, an 
aromatic or heteroaromatic moiety, -CF3, -CN, or the like. 

The term "aralkyl", as used herein, refers to an alkyl group substituted with an 
aryl group (e.g., an aromatic or heteroaromatic group). 

20 The terms "heterocyclyl" or "heterocyclic group" refer to non-aromatic 4- to 10- 

membered ring structures, more preferably 4- to 7-membered rings, which ring structures 
include one to four heteroatoms (e.g., O, N, S, P and the like). Heterocyclyl groups 
include, for example, pyrrolidine, oxolane, thiolane, imidazole, oxazole, piperidine, 
piperazine, morpholine, lactones, lactams such as azetidinones and pyrrolidinones, 

25 sultams, sultones, and the like. The heterocyclic ring can be substituted at one or more 
positions with such substituents as described above, as for example, halogen, alkyl, 
arallgrl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, 
alkoxycarbonyl, arylo?^cart>onyl, carboxyl, silyl, ether, alkylthio, alkylsulfonyl, 
arylsulfonyl, ketone (e.g., -C(0)-alkyl or -C(O)-aryl), aldehyde, heterocyclyl, an aryl or 

30 heteroaiyl moiety, -CF3, -CN, or the like. 

Compounds represented by the formula A-B-C, in which A, B. and C have the 
values described supra, can be synthesized by standard techniques of organic synthesis. 
For example, precursor synthons corresponding to each of the moieties A, B, and C, or 
subunits thereof, can be coupled in linear or convergent syntheses to provide HDx 
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inhibitor compounds, or compounds readily converted thereto. Syntheses of the HDx 
inhibitor compound trichostatin, and related compounds, have been reported; see, e.g., 
Massa, S. et al. (1990) J. Med Chem. 33:2845-49; Mori, K., and Kosecki, K. (1988) 
Tetrahedron 44:6013-20; Koseki, K. and Mori, K. European Patent Application EP 
5 331524 A2; Fleming, 1. etal. (1983) Tetrahedron 39:841-46. Analogs of trapoxin have 
also been synthesized; see, e.g., Yoshida, H, and Sugita, K. (1992) Jpn. J. Cancer Res, 
83:324-28. .^..^}-.^:\ 

Thus, in an illustrative synthesis, a compound represented by the formula A-B-C, 
in which A is an phenyl group, while B and C can have a variety of values, can be 
10 synthesized as shown below: 




3 

Scheme I 

According to the Scheme, a functionalized organometallic aryl compound 
15 (MX=organometallic moiety; R is any substituent; X is a leaving group, e.g., halogen) 
(e.g., organotin, boronate, aryllithium, cuprate, Grignard reagent, etc.) is alkylated or 
acylated to provide functionalized compounds (e.g.. the exemplary compounds L 2, or 1) 
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10 



15 



which can be further elaborated to provide compounds with a wide variety of substituents 
and carbon backbones. Other A moieties (c.g., specificity elements) can be obtained by 
use of appropriate synthons, e.g., by substituting vinylorganometallic compounds for the 
organometaUic aryl compound of the Scheme (followed by further treatment, e.g., 
reduction, of the vinyl group, if desired, to yield an alkyl A moiety). By way of 
illustration, as shown for compound I, the carbonyl group can be used for elaboration, 
e.g., by reduction of the carbbnyl group to an alcohol, conversion of the alcohol to a 
tosylate, and nucleophiUc displacement of the tosylate by an acyl compound (e.g., a 
ketone or ester) to provide a chain-lengthened product (Route A), which can be 
converted to a C(0)X functionaUty (e.g., by hydrolysis of an ester and conversion of the 
resulting carboxylic acid to an acid chloride). Alternatively, the carbonyl group of i can 
be used for olefination (Route B), e.g., Homer-Emmons olefination, to provide an 
elaborated alkenyl compound. Also, the carbonyl group can be converted to an alkynyl 
functionality, e.g., via the Corey-Fuchs procedure, to provide an elaborated alkynyl 
compound. For purposes of clarity, only certain chain lengths and functional group 
patterns are shown in the scheme; however, the skilled artisan will appreciate that many 
other compounds, with a variety of B moieties (i.e., linking moieties), can be synthesized 
through analogous procedures. The C(0)X functionality (e.g., an acid chloride where X 
IS CI) can be converted to functional groups such as amide, hydrazido, 
20 trifluormethylketone, enone, epoxide, aziridine. and the like, through methods 
conventional in the art. Thus, the synthetic pathways shown m the Scheme provide 
access to compounds having a variety of C moieties (e.g., reactive moieties) suitable for 
substitution in the subject HDx inhibitors. 

In vitro chemical synthesis provides a method for generating libraries of 
compounds that can be screened for ability to bind to or inhibit a target protein, e.g., an 
HDx. Although in vitro methods have previously been used in the pharmaceutical 
industry to identify potential dmgs, recently developed methods have focused on rapidly 
and efficiently generating and screening large numbers of compounds and are amenable to 
generating HDx inhibitor compound libraries for use in the subject method. The various 
approaches to simultaneous preparation and analysis of large number:} of compounds 
(herein "combinatorial synthesis") each rely on the fundamental concept of synthesis on a 
solid support introduced for peptides by Merrifield in 1963 (Merrifield, R.B. (1963) J Am 
Chem Soc 85:2149-2154). Many types of solid matrices have been successfully used in 
solid-phase synthesis, and can be selected according to the type of chemistry to be 
3 5 performed on the immobilized moieties, as is discussed in more detail below. 



25 



30 
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Several synthetic schemes have been suggested or employed for the combinatorial 
synthesis of organic compounds (see, e.g., E.M. Gordon ei ai, J, Med Chem, 37:1385- 
1401 (1994)). 

5 Muliipin Synthesis 

One method for combinatorial synthesis of compounds is the muitipin synthesis 
method. Briefly, Geysen and co-workers (Geysen et al. (1984) PNAS 81:3998-4002) 
introduced a method for generating compounds by a parallel synthesis on polyacrylic 
acid-grated polyethylene pins arrayed in the microtitre plate format. In the original 

10 experiments, about 50 nmol of a single compound was covalendy linked to the spherical 
head of each pin, and interactions of each compound with a receptor or antibody could be 
determined in a direct binding assay. The Geysen technique can be used to synthesize 
and screen thousands of compounds per week using the muitipin method, and the 
tethered compounds may be reused in many assays. In subsequent work, the level of 

15 compound loading on individual pins has been increased to as much as 2 ^mol/pin by 
grafting greater amounts of functionalized acrylate derivatives to detachable pin heads, 
and the size of the compound library has been increased (Valerio et al. (1993) Int J Pept 
Protein Res 42: 1-9). Appropriate linker moieties have also been appended to the pins so 
that the compounds may be cleaved from the supports after synthesis for assessment of 

20 purity and evaluation in competition binding or functional bioassays (Bray et ai. (1990) 
Tetrahedron Lett 31:5811-5814; Valerio et al. (1991) Anal Biochem 197:168-177; Bray 
et al. (1991) Tetrahedron Lett 32:6163-6166). 

More recent applications of the muitipin method have taken advantage of the 
deavable linker strategy to prepare soluble compound libraries (Maeji et al. (1990) J 
25 Immunol Methods 134:23-33; Gammon et al. (1991) J Exp Med 173:609-617; Mutch et 
al. (1991) Pep/ 4:132-137). 

Divide-Couple-Recombine 

In another embodiment, a variegated library of HDx inhibitor compounds is 
30 provided on a set of beads utilizing the strategy of divide-couple-recombtne (see, e.g., 
Houghten (1985) PNAS 82:5131-5135; and U.S. Patents 4,631,211; 5,440,016; 
5,480,971). Briefly^ as the name implies, at each synthesis step where degeneracy (e.g., a 
plurality of different moieties) is introduced into the library, the beads are divided into as 
many separate groups to correspond to the number of different residues (e.g., functional 
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groups or other moieties) to be added at that position, the diflferent residues coupled in 
separate reactions, and the beads recombined into one pool for the next step. 

In one embodiment, the divide-couple-recombine strategy can be carried out 
using the so-called "tea bag" method first developed by Houghten, where synthesis 
5 occurs on resin that is sealed inside porous polypropylene bags (Houghten et al. (1986) 
PNAS 82:5131-5135). Residues are coupled to the resins by piadng the bags in solutions 
of the appTdpfiate 'ihdividiial activated monomers, while all common steps such as resin 
washing and deprotection (if appropriate) are performed simultaneously in one reaction 
vessel. At the end of the synthesis, each bag contains a single compound, and the 

10 compounds may be liberated fi-om the resins using a multiple cleavage apparatus 
(Houghten et al. (1986) Int J Pept Protein Res 27:673-678). This technique oflfers 
advantages of considerable synthetic flexibility and has been partially automated (Beck- 
Sickinger et al. (1991) Pept Res 4:88-94). Moreover, compounds can be produced in 
sufficient quantities (> 500 ^miol) for purification and complete characterization if 

IS desired. 

Synthesis using the tea-bag approach is usefiil for the production of a library, 
albeit of limited size, as is illustrated by its use in a range of molecular recognition 
problems including antibody epitope analysis (Houghten et al. (1986) PNAS 82:5131- 
5135), peptide hormone structure-fimction studies (Beck-Sickinger et al. (1990) Int J 
20 Pept Protein Res 36:522-530; Beck-Sickinger et al. (1990) EurJBiochem 194:449-456), 
and protein conformational mapping (Zimmerman et al. (1991) EurJBiochem 200:519- 
528). 

25 Combinatorial Synthesis on Nontraditional Solid Supports 

The search for innovative methods of solid-phase synthesis has led to the 
investigation of alternative polymeric supports to the polystyrene-divinylbenzene matrix 
originally popularized by Merrifield. Cellulose, either in the form of paper disks 
(Blankemeyer-Menge et al. (1988) Tetrahedron Lett 29-5871-5874; Frank et al. (1988) 

30 Tetrahedron 44:603 1-6040; Eichler et al. (1989) Collect Czech Chem Commun 54:1746- 
1752; Frank, R. (1993) BioorgMed Chem Lett 3:425-430) or cotton fi-agments (Eichler 
et al. (1991) Pept Res 4:296-307; Schmidt et al. (1993) BioorgMed Chem Lett 3:441- 
446) has been successfijlly fiinaionalized for peptide synthesis. Typical loadings attained 
with cellulose paper range fi^om 1 to 3 jimol/cm^, and HPLC analysis of material cleaved 

35 fi-om these supports indicates a reasonable quality for the synthesized peptides. 
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Altematively, peptides may be synthesized on cellulose sheets via non-cleavable linkers 
and then used in ELISA-based binding studies (Frank, R. (1992) Tetrahedron 48:9217- 
9232). The porous, polar nature of this support may help suppress unwanted nonspecific 
protein binding effects. In one convenient configuration synthesis occurs in an 8 x 12 

5 microtiter plate format. Frank has used this technique to map the dominant epitopes of 
an antiserum raised against a human cytomegalovirus protein, following the overlapping 

^ peptide screening (Pepscan) strategy of Geysen (Frank, R. (1992) Tetrahedron 48:9217- 
9232). Other membrane-like supports that may be used for solid-phase synthesis include 
polystyrene-grafted polyethylene films (Berg et al. (1989) J Am Chem Sac 111:8024- 
10 8026). 

Combinatorial Libraries by Light-Directed. Spatially Addressable Parallel Chemical 
Synthesis 

A scheme of combinatorial synthesis in which the identity of a compound is given 
IS by its locations on a synthesis substrate is termed a spatially-addressable synthesis. In one 
embodiment, the combinatorial process is carried out by controlling the addition of a 
chemical reagent to specific locations on a solid support (Dower et al. (1991) Annu Rep 
Med Chem 26:271-280; Fodor, S.P.A. (1991) Science ISUST, Puiung et al. (1992) 
U.S. Patent No, 5,143,854; Jacobs et al. (1994) Trends Biotechnol 12:19-26). The 
20 technique combines two well-developed technologies: solid-phase synthesis chemistry 
and photolithography. The high coupling yields of solid-phase reactions allows efficient 
compound synthesis, and the spatial resolution of photolithography afifords 
miniaturization. The merging of these two technologies is done through the use of 
photolabile protecting groups, e.g., amino protecting groups, in the synthetic procedure. 

25 The key points of this technology are illustrated in Gallop et al. (1994) J Med 

Chem 37:1233-1251. A synthesis substrate is prepared for compound synthesis through 
the covalent attachment of photolabile nitroveratryloxycarbonyl (NVOC) protected 
amino linkers. Light is used to selectively activate a specified region of the synthesis 
support for coupling. Removal of the photolabile protecting groups by lights 

30 (deprotection) results in activation of selected areas. After activation, the first of a set of 
re^dues, each bearing a photolabile protecting group, is exposed to the entire surface. 
Coupling only occurs in regions that were addressed by light in the preceding step. The 
reagent solution is removed, and the substrate is again illuminated through a second 
mask, activating a different region for reaction with a second protected building block. 

35 The pattern of masks and the sequence of reactants define the products and their 
locations. Since this process utilizes photolithography techniques, the number of 
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compounds that can be synthesized is limited only by the number of synthesis sites that 
can be addressed with appropriate resolution. The position of each compound is 
precisely known; hence, its interactions with other molecules can be directly assessed. 
The target can be labeled with a fluorescent reporter group to facilitate the identification 
5 of specific interactions with individual members of the matrix. 

In a light-directed chemical synthesis, the products depend on the pattern of 
illumination and on the order of addition of reactants. By varying the lithographic 
patterns, many different sets of test compounds can be synthesized in the same number of 
steps; this leads to the generation of many different masking strategies. 

10 

Encoded Combinatorial Ubraries 

In yet another embodiment, the subject method provides an HDx inhibitor 
compound library provided with an encoded tagging system. A recent improvement in 
the identification of active compounds fi-om combinatorial libraries employs chemical 

15 indexing systems using tags that uniquely encode the reaction steps a given bead has 
undergone and, by inference, the struUure it carries. Conceptually, this approach mimics 
phage display libraries, where activity derives firom expressed peptides, but the structures 
of the active peptides are deduced fi-om the corresponding genomic DNA sequence. The 
first encoding of synthetic combinatorial libraries employed DNA as the code. Two forms 

20 of encoding have been reported: encoding with sequenceable bio-oligomers (e.g., 
oligonucleotides and peptides), and binary encoding with non-sequenceable tags. 

Tagging with sequenceable bio^ligomers 

The principle of using oligonucleotides to encode combinatorial synthetic libraries 
25 was described in 1992 (Brenner et al. (1992) PNAS 89:5381-5383), and an example of 
such a library appeared the following year (Needles et al. (1993) PNAS 90: 10700-10704). 
A combinatorial library of nominally 7^ (= 823,543) peptides composed of all 
combinations of Arg, Gin, Phe, Lys, Val, D-Val and Thr (three-letter amino acid code), 
each of which was encoded by a specific dinucleotide (TA, TC, CT, AT, TT, CA and 
30 AC, respectively), was prepared by a series of alternating rounds of peptide and 
oligonucleotide synthesis on solid support. In this work, the amine linking functionality 
on the bead was specifically differentiated toward peptide or oligonucleotide synthesis by 
simultaneously preincubating the beads with reagents that generate protected OH groups 
for oligonucleotide synthesis and protected NH2 groups for peptide synthesis (here, in a 
35 ratio of 1:20). When complete, the tags each consisted of 69-mers, 14 units of which 
earned the code. The bead-bound library was incubated with a fluorescently labeled 
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antibody, and beads containing bound antibody that fluoresced strongly were harvested 
by fluorescence-activated cell sorting (FACS). The DNA tags were amplified by PGR and 
sequenced, and the predicted peptides were synthesized. Following such techniques, 
HDx inhibitor compound libraries can be derived and screened using HDxs of the subject 
5 invention. 

It is noted that an alternative approach useful for generating nucleotide-encoded 
synthetic peptide libraries employs a branched linker containing selectively protected OH 
and NH2 groups (Nielsen et al. {1992) J Am Chem Soc 115:9812-9813; and Nielsen et al. 
(1994) Methods Compm Methods Enzymol 6:361-371). This approach requires that 
10 equimolar quantities of test peptide and tag co-exist, though this may be a potential 
complication in assessing biological activity, especially with nucleic acid based targets. 

The use of oligonucleotide tags permits exquisitely sensitive tag analysis. Even so, 
the method requires careful choice of orthogonal sets of protecting groups required for 
alternating co-synthesis of the tag and the library member. Furthermore, the chemical 

15 lability of the tag, particularly the phosphate and sugar anomeric linkages, may limit the 
choice of reagents and conditions that can be employed for the synthesis on non- 
oligomeric libraries. In preferred embodiments, the libraries employ linkers permitting 
selective detachment of the test HDx inhibitor compound library member for bioassay, in 
part (as described infra) because assays employing beads limit the choice of targets, and 

20 in part because the tags are potentially susceptible to biodegradation. 

Peptides themselves have been employed as tagging molecules for combinatorial 
libraries. Two exemplary approaches are described in the art, both of which employ 
branched linkers to solid phase upon which coding and ligand strands are alternately 
elaborated. In the first approach (Kerr JM et al. (1993) J Am ChemSoc 115:2529-253 1), 
25 orthogonality in synthesis is achieved by employing acid-labile protection for the coding 
strand and base-labile protection for the ligand strand. 

In an alternative approach (Nikolaiev et al. (1993) PeptRes 6:161-170), branched 
linkers are employed so that the coding unit and the test peptide are both attached to the 
same functional group on the resin. In one embodiment, a linker can be placed between 
30 the branch point and the bead so that cleavage releases a molecule containing both code 
and ligand (Ptek et al. (1991) Tetrahedron Lett 32:3891-3894). In another embodiment, 
the linker can be placed so that the test peptide can be selectively separated from the 
bead, leaving the code behind. This last construct is particularly valuable because it 
pennits screening of the test peptide without potential interference, or biodegradation, of 
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the coding groups. Examples in the art of independent cleavage and sequencing of 
peptide library members and their corresponding tags has confirmed that the tags can 
accurately predict the peptide structure. 

It is noted that peptide tags are more resistant to decomposition during ligand 
5 synthesis than are oligonucleotide tags, but they must be employed in molar ratios nearly 
equal to those of the ligand on typical 130 ^m beads in order to be successfully 
sequenced. As with oligonucleotide encoding, the use of peptides as tags requires 
complex protection/deprotection chemistries. 

10 Non-sequenceable tagging: binary encoding 

An alternative form of encoding the test peptide library employs a set of non- 
sequenceable tagging molecules (e.g., molecules having electrophone moieties) that are 
used as a binary code (Ohlmeyer et al. (1993) PNAS 90:10922-10926). Exemphiry tags 
are haloaromatic alkyl ethers that are detectable as their trimethylsifyl ethws at less than 

15 femtomolar levels by electron capture gas chromatography (ECGC). Variations in the 
length of the alkyl chain, as jvell as the nature and position of the aromatic halide 
substituents, permit the synthesis of at least 40 such tags, which in principle can encode 
2'W (e.g., upwards of 10l2) different molecules. In the original report (Ohlmeyer et al., 
supra) the tags were bound to about 1% of the available amine groups of a peptide 

20 library via a photocleavable 0-nitrobenzyl linker. This approach is convenient when 
preparing combinatorial libraries of peptides or other amine-containing molecules. A 
more versatile system has, however, been developed that permits encoding of essentially 
any combinatorial library. Here, the ligand is attached to the solid support via the 
photocleavable linker and the tag is attached through a catechol ether linker via carbene 

25 insertion into the bead matrix (Nestler et al. (1994) J Org Chem 59:4723-4724). This 
orthogonal attachment strategy permits the selective detochment of hT)rary members for 
bioassay in solution and subsequent decoding by ECGC after oxidative detachment of the 
tag sets. 

Binary encoding with tags, e.g., electrophoric tags, has been particularly usefiil in 
defining selective interactions of substrates with synthetic recq)tore (Borchardt et al. 
(1994) J Am Chem Soc 116:373-374), and model systems for understanding the binding 
and catalysis of biomolecules. Even using detailed molecular modeling, the identification 
of the selectivity preferences for synthetic receptors has required the manual synthesis of 
dozens of potential substrates. The use of encoded libraries makes it possible to rapidly 
examine all the members of a potential binding set. The use of binaiy-encoded libraries 
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has made the determination of binding selectivities so facile that structural selectivity has 
been reported for four novel synthetic macrobicyclic and tricyclic receptors in a single 
conmiunication (Wennemers et al. (1995) J Org Chem 60:1108-1109; and Yoon et al. 

(1994) Tetrahedron Lett 35:8557-8560) using the encoded libraiy mentioned above. 
5 Similar facility in defining specificity of interaction would be expected for many other 

biomolecules. 

Although the several amide-linked libraries in the ait employ binary encoding with 
the electrophoric tags attached to amine groups, attaching these tags direcdy to the bead 
matrix provides far greater versatility in the structures that can be prepared in encoded 
10 combinatorial libraries. Attached in this way, the tags and their linker are neariy as 
uru'cactive as the bead matrix itself Two binary-encoded combinatorial libraries have 
been reported where the tags are attached directly to the solid phase (Ohlmeyer et al. 

(1995) PNAS 92:6027-6031) and provide guidance for generating the subject HDx 
inhibitor compound library. Both libraries were constructed using an orthogonal 

IS attachment strategy in which the library member was linked to the solid support by a 
photolabile linker and the tags were attached through a linker cleavable only by vigorous 
oxidation. Because the library members can be repetitively partially photoeluted fi'om the 
solid support, library members can be utilized in multiple assays. Successive photoelution 
also permits a very high throughput iterative screening strategy: first, multiple beads are 

20 placed in 96-weII microtiter plates; second, ligands are partially detadied and transferred 
to assay plates; third, a bioassay identifies the active wells; fourth, the corresponding 
beads are rearrayed singly into new microtiter plates; fifth, single active compounds are 
identified; and sbcth, the structures are decoded. 

The above approach was employed in screening for carbonic anhydrase (CA) 
25 binding and identified compounds which exhibited nanomolar affinities for CA. Unlike 
sequenceable tagging, a large number of structures can be rapidly decoded fi-om binary- 
encoded libraries (a single ECGC apparatus can decode 50 structures per day). Thus, 
binary-encoded libraries can be used for the rapid analysis of structure-activity 
relationships and optimization of both potency and selectivity of an active series. The 
30 synthesis and screening of large unbiased binary encoded HE>x inhibitor compound 
libraries for lead identification, followed by preparation and analysis of smaller focused 
libraries for lead optimization, offers a particularly powerful approach to discovery of 
HDx inhibitor compounds. 

HDx inhibitor compounds can be synthesized on solid support by appropriate 
35 fiinctionaUsiation for attachment to a solid matrix, or alternatively, by solution-phase 
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synthesis followed by immobilization through an appropriate functional group. Thus, in 
an illustrative embodiment, an HDx inhibitor compound, which is analogous to 
trichostatin, can be synthesized on a solid support by attachment through an amino group 
of the specificity element A, as shown in Figure 7. The solid support is preferably 
5 capable of withstanding synthetic conditions required to syntheaze the requisite 
compounds. The compound can preferably be released firom the solid support, e.g., by 
selective cleavage of an amide bond. : „ 

The synthetic steps employed to synthesize compounds on solid support are 
preferably selected to allow a wide variety of residues (e.g., building blocks) to be 
10 coupled to the immobilized moieties, preferably under mild conditions. Suitable reaction 
chemistries include well-known carbon-caibon bond forming reactions such as the Stille 
and Suzuki couplings, as well as Homer-Emmons reactions, Ni/Cr mediated couplings, 
and the like. Particularly preferred coupling reactions can be performed in the presence 
of water and do not require harsh conditions or expensive reagents. 

15 Thus, in an exemplary synthesis shown in Figure 7, substituted N-methyl-4- 

(tributyltin)anilines (in which represents one or more substitutions, e.g., hydrogen, 
halogen, alkyl, alkoxy, and the like) are coupled in a plurality of reaction vessels to beads 
of a solid support (e.g., AfBgel). The beads are further divided into a pluraHty of reaction 
vessels, and suspended in a solvent such as DMF, and one acid chloride building block 

20 (corresponding to linking element B) is introduced into each vessel (R2 and R3 represent, 
e.g., hydrogen, halogen, alkyl, and the like; and the broken line represents an optional 
double bond). The reactions are stirred under an inert gas (e.g. nitrogen) and a palladium 
catalyst (e.g., is added (0.1-1.0 mol%). The reaction is stirred for 1-24 

hours. Upon completion of the reaction, the beads are washed, and placed in a plurality 

25 of vessels. The aldehyde moiety is deprotected by mild acid treatment (e.g., PPTS in 
MeOH), and the beads are again washed and placed in a plurality of reaction vessels, and 
the beads are suspended in dry acetonitrile. One building block (corresponding to the 
reactive element C) is then added to each reaction vessel. As iUustratively shown in 
Figure 7, a plurality of phosphonates can be employed (R4 represents, e.g., all^l, alkenyl, 

30 alkynyl, alkoxy, and the like). A Homer-Emmons reaction is performed by addition of 
LiCl (1.1 equiv.) and diisopropylethylamine (DIPEA) or DBU (1.2 equiv). Upon 
completion of the reaction, the beads are washed with water and acetonitrile, and then 
dried to yield a library of candidate HDx inhibitor compounds on solid support. The 
compounds can then be released from the solid support into solution; or the compounds 

35 can be screened while attached to the solid support. 
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The above combinatorial synthesis can be performed in an encoded mode, e.g., 
the binary tagging method described supra, by addition of the appropriate tag for each 
monomer. In this mode, after each reaction has been performed and the corresponding 
tag attached, the beads from all reactions can be recombined and then divided into 
5 aliquots for further derivatization. This method provides the advantage of ease of 
handling when large libraries are to be synthesized. Regardless of the method of 
synthesis, the combinatorial library, can be screened-^ for activity according to known 
methods (see, e.g., Gordon et aL, supra), 

10 In another aspect, the present invention provides pharmaceutically acceptable 

compositions which comprise a therapeutically-effective amount of one or more of the 
compounds described above, formulated together with one or more pharmaceuticaUy 
acceptable carriers (additives) and/or diluents. As described in detail below, the 
pharmaceutical compositions of the present mvention may be specially formulated for 

15 administration in solid or liquid form, including those adapted for the folIo\snng: (1) oral 
administration, for example^ drenches (aqueous or non-aqueous solutions or 
suspensions), tablets, boluses, powders, granules, pastes for application to the tongue; (2) 
parenteral administration, for example, by subcutaneous, intramuscular or intravenous 
injection as, for example, a sterile solution or suspension; (3) topical application, for 

20 example, as a cream, ointment or spray applied to the skin; or (4) intravaginally or 
intrarectally, for example, as a pessary, cream or foam. 

The phrase "therapeutically-effective amount" as used herein means that amount 
of a compound, material, or composition comprising a deacetylase inhibitor of the present 
invention which is effective for producing some desired therapeutic efifect by inhibiting 
25 histone deacetylation in at least a sub-population of cells in an animal and thereby 
blocking the biological consequences of that event in the treated cells, at a reasonable 
benefit/risk ratio applicable to any medical treatment. 

The phrase "pharmaceutically acceptable" is employed herein to refer to those 
compounds, materials, compositions, and/or dosage forms which are, within the scope of 
30 sound medical judgment, suitable for use in contact v^th the tissues of human beings and 
animals without excessive toxicity, irritation, aUergic response, or other problem or 
complication, commensurate with a reasonable benefit/risk ratio. 

The phrase "pharmaceutically-acceptable carrier" as used herein means a 
pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid 
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fiiler, diluent, cxcipient, solvent or encapsulating material, involved in carrying or 
transporting the subject deacetylase inhibitor agent from one organ, or portion of the 
body, to another organ, or portion of the body. Each carrier must be "acceptable" in the 
sense of being compatible with the other ingredients of the formulation and not injurious 
5 to the patient. Some examples of materials which can serve as pharmaceutically- 
acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, 
such as com starch and potato starch; (3) cellulose-; and its. derivatives, such as sodium 
cari)oxymethyI cellulose, ethyl cellulose and cellulose acetate; (4) powdered tragacanth; 
(5) malt; (6) gelatin; (7) talc; (8) excipients, such as cocoa butter and suppository waxes; 

1 0 (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oD, olive oil, com oU and 
soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, 
sorbitol, mannitol and polyethylene glycol; (12) esters, such as ethyl oleate and ethyl 
laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum 
hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's 

15 solution; (19) ethyl alcohol; (20) phosphate buffer solutions; and (21) other non-toxic 
compatible substances employed in pharmaceutical formulations. 

As set out above, certain embodiments of the present deacet)dase inhibitors may 
contain a basic fimctional group, such as ammo or alkylamino, and are, thus, capable of 
fonning pharmaceutically-acceptable salts with pharmaceutically-acceptable adds. The 
term "pharmaceutically-acceptable salts" in this respect, refers to the relatively non-toxic, 
inorganic and organic acid addition salts of compounds of the present invention. These 
salts can be prepared in situ during the final isolation and purification of the compounds 
of the invention, or by separately reacting a purified compound of the invention in its free 
base form with a suitable organic or inorganic acid, and isolating the salt thus formed. 
25 Representative salts include the hydrobromide, hydrochloride, sulfate, bisulfate, 
phosphate, nitrate, acetate, valerate, oleate, palmitate, stearate, laurate. benzoate, lactate, 
phosphate, tosylate, citrate, maleate. fiimarate, succinate, tartrate, napthylate, mesylate, 
glucoheptonate, lactobionate, and laurylsulphonate salts and the like. (See, for estample, 
Berge et al. (1977) "Pharmaceutical Salts", J. Pharm. Sci. 66: 1-19) 

In other cases, the deacetylase inhibitory compounds of the present invention may 
contain one or more acidic functional groups and. thus, are capable of forming 
pharmaceutically-acceptable salts with pharmaceutically-acceptable bases. The term 
"pharmaceutically-acceptable salts" in these instances refers to the relatively non-toxic, 
inorganic and organic base addition salts of compounds of the present invention. These 
salts can likewise be prepared in situ during the final isolation and purification of the 
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cx)mpounds, or by separately reacting the purified compound in its free acid form with a 
suitable base, such as the hydroxide, carbonate or bicarf)onate of a pharmaceutically- 
acceptable metal cation, with anunonia, or with a pharmaceutically-acceptable organic 
primary, secondary or tertiary amine. Representative alkali or alkaline earth salts include 
5 the lithium, sodium, potassium, calcium, magnesium, and aluminum salts and the like. 
Representative organic amines useful for the formation of base addition salts include 
ethylaniine,.diethylamine,,ethyleriedianiine, ethanolaniine, diethanolamine, piperazine and 
the like. (See, for example, Berge et al., supra) 

Wetting agents, emulsifiers and lubricants, such as sodium lauryl sulfate and 
10 magnesium stearate, as well as coloring agents, release agents, coating agents, 
sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be 
present in the compositions. 

Examples of pharmaceutically-acceptable antioxidants include: (1) water soluble 
antioxidants, such as ascorbic acid, cysteine hydrochloride, sodium bisul&te, sodium 
15 metabisulfite, sodium sulfite and the like; (2) oil-soluble antioxidants, such as ascori)jd 
palmitate, butylated hydroxyanisole (BHA), butylated hydroxytoluene (BHT), lecithin, 
propyl gallate, alpha-tocopherol, and the like; and (3) metal chelating agents, such as 
citric acid, ethylenediamine tetraacetic acid (EDTA), sorbitol, tartaric acid, phosphoric 
acid, and the like. 

20 Formulations of the present invention include those suitable for oral, nasal, topical 

(including buccal and sublingual), rectal, vaginal and/or parenteral administration. The 
formulations may conveniently be presented in unit dosage form and may be prepared by 
any methods well known in the art of pharmacy. The amount of active ingredient which 
can be combined with a carrier material to produce a single dosage form will vary 

25 depending upon the host being treated, the particular mode of administration. The 
amount of active ingredient which can be combined with a carrier material to produce a 
single dosage form will generally be that amount of the deacetylase inhibitor which 
produces a therapeutic effect. Generally, out of one hundred per cent, this amount will 
range from about 1 per cent to about ninety-nine percent of active ingredient, preferably 

30 fix)m about 5 per cent to about 70 per cent, most preferably from about 10 per cent to 
about 30 per cent. 

Methods of preparing these formulations or compositions include the step of 
bringing into association a compound of the present invention with the carrier and, 
optionally, one or more accessory ingredients. In general, the formulations iare prepared 



wo 97/35990 



PCT/US97/05275 



-77- 



by uniformly and intimately bringing into association a deacetylase inhibitor of the present 
invention with liquid carriers, or finely divided solid carriers, or both, and then, if 
necessary, shaping the product. 

Formulations of the invention suitable for oral administration may be in the form 
5 of capsules, cachets, pUls, tablets, lozenges (using a flavored basis, usually sucrose and 
acacia or tragacanth), powders, granules, or as a solution or a suspension in an aqueous 
or non-aqueous Uquid, or as an dil-in^water or water-in-oil liquid emulsion, or as an elbdr 
or syrup, or as pastilles (using an inert base, such as gelatin and glycerin, or sucrose and 
acacia) and/or as mouth washes and the like, each containing a predetermined amount of 
10 a compound of the present invention as an active ingredient. A deacetylase inhibitor of 
the present invention may also be administered as a bolus, electuary or paste. 

In solid dosage forms of the invention for oral administration (capsules, tablets, 
pills, dragees, powders, granules and the like), the active ingredient is mbced with one or 
more phaimaceutically-acceptable carriers, such as sodium citrate or dicalcium 

15 phosphate, and/or any of the following: (1) fillers or extenders, such as starches, lactose, 
sucrose, glucose, mannitol, and/or silicic acid; (2) binders, such as, for example, 
carboxymethylcellulose, alginates, gelatin, polyvinyl pyirolidone, sucrose and/or acacia; 
(3) humectants, such as glycerol; (4) disintegrating agents, such as agar-agar, calcium 
carbonate, potato or tapioca starch, alginic acid, certain siUcates, and sodium cari)onate; 

20 (5) solution retarding agents, such as paraffin; (6) absorption accelerators, such as 
quaternary ammonium compounds; (7) wetting agents, such as, for example, cetyl alcohol 
and glycerol monostearate; (8) absorbents, such as kaolin and bentonite clay; (9) 
lubricants, such a talc, calcium stearate, magnesium stearate, solid polyethylene glycols, 
sodium lauryl sulfate, and mbctures thereof; and (10) coloring agents. In the case of 

25 capsules, tablets and pills, the pharmaceuUcal compositions may also comprise buffering 
agents. SoUd compositions of a similar type may also be employed as fillers in soft and 
hard-filled gelatin capsules using such exdpients as lactose or milk sugars, as well as high 
molecular wdght polyethylene glycols and the like. 

A tablet may be made by compression or molding, optionally with one or more 
30 accessory ingredients. Compressed tablets may be prepared using binder (for example, 
gdatin or hydroxypropyimethyl ceUulose), lubricant, inert diluent, preservative, 
disintegrant (for example, sodium starch glycolate or cross-linked sodium carboxymethyl 
ceUulose), surface-active or dispersing agent. Molded tablets may be made by molding in 
a suitable machine a mixture of the powdered deacetylase inhibitor moistened with an 
35 inert liquid diluent. 
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The tablets, and other solid dosage forms of the pharmaceutical compositions of 
the present invention, such as dragees, capsules, pills and granules, may optionally be 
scored or prepared with coatings and shells, such as enteric coatings and other coatings 
well known in the pharmaceutical-fonmulating art. They may also be formulated so as to 
5 provide slow or controlled release of the active ingredient therein using, for example, 
hydroxypropylmethyl cellulose in varying proportions to provide the desired release 
profile, other«pdlymer matrices, liposomes and/or microspheres. They may be sterilized 
by, for example, filtration through a bacteria-retaining filter, or by incorporating 
sterilizing agents in the form of sterile solid compositions which can be dissolved in sterile 

10 water, or some other sterile injectable medium immediately before use. These 
compositions may also optionally contain opacifying agents and may be of a composition 
that they release the active ingredient(s) only, or preferentially, in a certain portion of the 
gastrointestmal tract, optionally, in a delayed manner. Examples of embedding 
compositions which can be used include polymeric substances and waxes. The active 

15 ingredient can also be in micro-encapsulated form, if appropriate, with one or more of the 
above-described excipients. 

Liquid dosage forms for oral administration of the deacetylase inhibitors of the 
invention include pharmaceutically acceptable emulsions, microemulsions, solutions, 
suspensions, syrups and elixirs. In addition to the active ingredient, the liquid dosage 
20 fonms may contain inert diluents commonly used in the art, such as, for example, water or 
other solvents, solubilizing agents and emulsifiers, such as ethyl alcohol, isopropyl 
alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propylene glycol, 
1,3-butylene glycol, oils (in particular, cottonseed, groundnut, com, germ, olive, castor 
and sesame oils), glycerol, tctrahydrofiiryl alcohol, polyethylene glycols and fatty acid 
^ 25 esters of sorbitan, and mixtures thereof 

Besides inert diluents, the oral compositions can also include adjuvants such as 
wetting agents, emulsifying and suspending agents, sweetening, flavoring, coloring, 
perfimiing and preservative agents. 

Suspensions, in addition to the active deacetylase inhibitor, may contain 
30 suspending agents as, for example, ethoxylated isostearyl alcohols, polyoxyethylene 
sorbitol and sorbitan esters, microcrystalline cellulose, aluminum metahydroxide, 
bentonite, agar-agar and tragacanth, and mixtures thereof 

Formulations of the pharmaceutical compositions of the invention for rectal or 
vaginal administration may be presented as a suppository, which may be prepared by 
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mixing one or more compounds of the invention with one or more suitable nonirritating 
excipients or carriers comprising, for example, cocoa butter, polyethylene glycol, a 
suppository wax or a salicylate, and which is soUd at room temperature, but liquid at 
body temperature and, therefore, will melt in the rectum or vaginal cavity and release the 
5 active deacetylase inhibitor. 

Formulations of the present invention which are suitable for vaginal administration 
also include pessaries, tampons, creams, gels, pastes, foams or spray formulations 
containing such carriers as are known in die art to be appropriate. 

Dosage forms for the topical or transdermal administration of a deacetylase 
10 inhibitor of this invention include powders, sprays, ointments, pastes, creams, lotions, 
gels, solutions, patches and inhalants. The active compound may be mbced under sterile 
conditions with a pharmaceutically-acceptable carrier, and with any preservatives, buflfers, 
or propellants which may be required. 

The ointments, pastes, creams and gels may contam, in addition to an active 
15 deacetylase inhibitor of this invention, excipients, such as animal and vegetable fats, oils, 
waxes, paraffins, starch, tragacanth, cellulose derivatives, polyethylene glycols, siUcones, 
bentoiutes, silicic add, talc and anc oxide, or nuxtures thereof. 

Powders and sprays can contain, in addition to a compound of this invention, 
excipients such as lactose, talc, silicic acid, aluminum hydroxide, calcium silicates and 
20 polyamide powder, or mixtures of these substances. Sprays can additionally contain 
customary propellants, such as chlorofluorohydrocariwns and volatile unsubstituted 
hydrocarbons, such as butane and propane. 

Transdermal patches have the added advantage of providing controlled delivery of 
a compound of the present invention to the body. Such dosage forms can be made by 
dissolving or dispersing the deacetylase inhibitor in the proper medium. Absorption 
enhancers can also be used to increase the flux of the deacetylase inhibitor across the 
skin. The rate of such flux can be controlled by either providing a rate controlling 
membrane or dispersing the deacetylase inhibitor in a polymer matrix or gel. 

Ophthalmic formulaUons, eye ointments, powders, solutions and the like, are also 
30 contemplated as bang within the scope of this invention. 

Pharmaceutical compositions of this invention suitable for parenteral 
administration comprise one or more deacetylase inhibitors of the invention in 
combination with one or more phannaceutically-acceptable sterile isotonic aqueous or 
nonaqueous solutions, dispersions, suspensions or emulsions, or sterile powders which 
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may be reconstituted into sterile injectable solutions or dispersions just prior to use, 
which may contain antioxidants, buffers, bacteriostats, solutes which render the 
formulation isotonic with the blood of the intended recipient or suspending or thickening 
agents. 

5 Examples of suitable aqueous and nonaqueous carriers which may be employed in 

the pharmaceutical compositions of the invention include water, ethanol, polyols (such as 
glycerol, propylene glycol, polyethylene glycol, and the like), and suitable mixtures 
thereof, vegetable oils, such as olive oil, and injectable organic esters, such as ethyl 
oleate. Proper fluidity can be maintained, for example, by the use of coating materials, 
1 0 such as lecithin, by the maintenance of the required particle size in the case of dispersions, 
and by the use of surfactants. 

These compositions may also contain adjuvants such as preservatives, wetting 
agents, emulsifying agents and dispersing agents. Prevention of the action of 
microorganisms may be ensured by the inclusion of various antibacterial and antifungal 
IS agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also 
be desirable to include isotonic agents, such as sugars, sodium chloride, and the like into 
the compositions. In addition, prolonged absorption of the injectable pharmaceutical form 
may be brought about by the inclusion of agents which delay absorption such as 
aluminum monostearate and gelatin, 

20 In some cases, in order to prolong the effect of a drug, it is desirable to slow the 

absorption of the drug from subcutaneous or intramuscular injection. This may be 
accomplished by the use of a liquid suspension of crystalline or amorphous material 
having poor water solubility. The rate of absorption of the drug then depends upon its 
rate of dissolution which, in turn, may depend upon crystal size and crystalline form. 

25 Alternatively, delayed absorption of a parenteraily-administered drug form is 
accomplished by dissolving or suspending the drug in an oil vehicle. 

Injectable depot forms are made by forming microencapsule matrices of the 
subject deacetylase inhibitors in biodegradable polymers such as polylactide- 
polyglycolide. Depending on the ratio of drug to polymer, and the nature of the particular 
30 polymer employed, the rate of drug release can be controlled. Examples of other 
biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot injectable 
formulations are also prepared by entrapping the drug in liposomes or microemulsions 
which are compatible with body tissue. 
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When the compounds of the present invention are administered as 
pharmaceuticals, to humans and animals, they can be given per se or as a pharmaceutical 
composition containing, for example. 0.1 to 99.5% (more preferably, 0.5 to 90%) of 
active ingredient in combination with a pharmaceutically acceptable carrier. 

5 The preparations of the present invention may be given orally, parenterally, 

topically, or rectally. They are of course given by forms suitable for each administration 
route. For example, they are administered in tablets or capsule form, by injection, 
inhalation, eye lotion, ointment, suppository, etc. administration by injection, infusion or 
inhalation; topical by lotion or ointment; and rectal by suppositories. Oral administration 
10 is preferred. 

These deacetylase inhibitor may be administered to humans and other animals for 
therapy by any suitable route of administration, including orally, nasally, as by, for 
example, a spray, rectally, intravaginally, parenterally, intracistemally and topically, as by 
powders, ointments or drops, including buccally and sublingually. 

15 Regardless of the route of administration selected, the compounds of the present 

invention, which may be used in a suitable hydrated form, and/or the pharmaceutical 
compositions of the present invention, are formulated into pharmaceuttcally-acccptable 
dosage forms by conventional methods knovm to those of skill in the art. 

Actual dosage levels of the active ingredients in the pharmaceutical compositions 
20 of this invention may be varied so as to obtain an amount of the active ingredient which is 
effective to achieve the desired therapeutic response for a particular patient, composition, 
and mode of administration, without being toxic to the patient. 

The selected dosage level will depend upon a variety of fectors including the 
activity of the particular deacetylase inhibitor employed, or the ester, salt or amide 
thereof, the route of administration, the time of administration, the rate of excretion of 
the particular compound being employed, the duration of the treatment, other drugs, 
compounds and/or materials used in combination with the particular deacetylase inhibitor 
employed, the age, sex, weight, condition, general health and prior medical histoiy of the 
patient being treated, and like factors well known in the medical arts. 

A physician or veterinarian having ordinary skill in the art can readily determme 
and prescribe the effective amount of the pharmaceutical composition required. For 
example, the physician or veterinarian could start doses of the compounds of the 
invention employed in the pharmaceutical composition at levels lower than that required 
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in order to achieve the desired therapeutic effect and gradually increase the dosage until 
the desired effect is achieved. 

Another aspect of the present invention relates to a method of inducing and/or 
maintaining a differentiated state, enhancing survival, and/or inhibiting (or alternatively 
5 potentiating) proliferation of a cell, by contacting the cells with an agent which modulates 
/fZ)x-dependent transcription. For instance, it is contemplated by the invention that, in 
light of the present finding of an apparently broad involvement of HDx proteins in the 
control of chromatin structure and, thus, transcription and replication, the subject method 
could be used to generate and/or maintain an array of different tissue both in vitro and in 
10 vivo. An "JTDx therapeutic," whether inhibitory or potentiating vnih respect to 
modulating histone deacetylation, can be, as appropriate, any of the preparations 
described above, including isolated polypeptides, gene therapy constructs, antisense 
molecules, peptidomimetics or agents identified in the drug assays provided herein. 

The HDx compounds of the present invention are likely to play an important role 
IS in the modulation of cellular proliferation. There are a wide variety of pathological cell 
proliferative conditions for which HDx therapeutics of the present invention may be used 
in treatment. For instance, such agents can provide therapeutic benefits where the 
general strategy being the inhibition of an anomalous cell proliferation. Diseases that 
might benefit from this methodology include, but are not limited to various cancers and 
20 leukemias, psoriasis, bone diseases, fibroproliferative disorders such as involving 
connective tissues, atherosclerosis and other smooth muscle proliferative disorders, as 
well as chronic inflammation. 

In addition to proliferative disorders, the present invention contemplates the use 
of HDx therapeutics for the treatment of differentiative disorders which result from, for 

2S example, de-dififerentiation of tissue which may (optionally) be accompaiued by abortive 
reentry into mitosis, e.g. apoptosis. Such degenerative disorders include chronic 
neurodegenerative diseases of the nervous system, including Alzheimer's disease, 
Parkinson's disease, Huntington's chorea, amylotrophic lateral sclerosis and the like, as 
well as spinocerebellar degenerations. Other differentiative disorders include, for 

30 example, disorders associated with connective tissue, such as may occur due to de- 
difierentiation of chondrocytes or osteocytes, as well as vascular disorders which involve 
de-differentiation of endothelial tissue and smooth muscle cells, gastric ulcers 
characterized by degenerative changes in glandular cells, and renal conditions marked by 
failure to differentiate, e.g. Wilm's tumors. 
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It will also be apparent that, by transient use of modulators ofHDx activities, in 
vivo reformation of tissue can be accomplished, e.g. in the development and maintenance 
of organs. By controlling the proliferative and differentiative potential for different cells, 
the subject HDx therapeutics can be used to reform injured tissue, or to improve grafting 
5 and morphology of transplanted tissue. For instance, HDx antagonists and agonists can 
be employed in a differential manner to regulate different stages of organ repair after 
physical, chemical or pathological insult. For example, such regimens can be utilized in 
repair of cartilage, increasing bone density, liver repair subsequent to a partial 
hepatectomy, or to promote regeneration of lung tissue in the treatment of emphysema. 
10 The present method is also applicable to cell culture techniques. 

In one embodiment, the HDx therapeutic of the present invention can be used to 
induce differentiation of uncommitted progenitor cells and thereby give rise to a 
committed progenitor cell, or to cause further restriction of the developmental fete of a 
committed progenitor cell towards becoming a terminally-differentiated cdL For 

15 example, the present method can be used in vitro or in vivo to induce and/or maintain the 
differentiation of hematopoietic cells into erythrocytes and other cells of the 
hematopoietic system. In an illustrative embodiment, the effect of erythropioetin (EPO) 
on the growth of EPO-responsive erythroid precursor cells is increased to influence their 
differentiation into red blood cells. For example, as a result of administering an inhibitor 

20 of histone deacetylation, the amount of EPO, or other diferentiating agent, required for 
growth and/or differentiation is reduced (PCT/US92/07737). Accordingly, the HDx 
therapeutics of the present invention, particularly those which antagonize HDx 
deacetylase activity, can be administered alone or in conjunction with EPO and in a 
suitable carrier to vertebrates to promote erythropoiesis. Alternatively, cells could be 

25 treated ex vivo. Such treatment is contemplated in the treatment of a variety of disease 
states, including in individuals who require bone marrow transplants (e.g. patients with 
aplastic anemia, acute leukemias, recurrent lymphomas, or solid tumors). 

To illustrate, prior to receiving a bone marrow transplant, a recipient is prepared 
by ablating or removing endogenous hematopoietic stem cells. Such treatment is usually 
30 carried out by total body irradiation or delivery of a high dose of an alkylating agent or 
other chemotherapeutic, cytotoxic agent, Anklesaria, et al. (1987) PNAS 84:7681-7685). 
Following preparation of the recipient, donor bone marrow cells are injected 
intravenously. Optionally, the HDx therapwtics of the present invention could be 
contacted with the cells ex vivo or administered to the subject with the reimplanted cells. 
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It is also contemplated that there may be cell-type specific HDx proteins, and/or 
that some cell types may be more sensitive to modulation of HDx deacetylase activities. 
Even within a cell type, the stage of diflferentiation or position in the cell cycle could 
influence their response to an HDx therapeutic. Accordingly, the present invention 
5 contemplates the use of agents which modulate histone deacetylase activity to specifically 
inhibit or activate certain cell types. In an illustrative example, T cell proliferation could 
be preferentially inhibited in order to induce tolerance by using a procedure similar to that 
for inducing tolerance using sodium butyrate (see, for example, PCT/US93/03045). To 
illustrate, the HDx therapeutics of the present invention may be used to induce antigen- 
specific tolerance in any situation in which it is desirable to induce tolerance, such as 
autoimmune diseases, in allogeneic or xenogeneic transplant recipients, or in graft versus 
host (GVH) reactions. According to the invention, tolerance vwU typically be induced by 
presenting the tolerizing compound (e.g., an HDx inhibitor) substantially 
contemporaneously with the antigen, i.e. reasonably close together in time with the 
antigen. In preferred embodiments the HEhc therapeutic will be administered after 
presentation of the antigen, so that they will have their eflfect after the particular 
repertoire of Th cells begins to undergo clonal expansion. 

Yet another aspect of the present invention concerns the application of HDx 
therapeutics to modulating morphogenic signals involved in organogenic pathways. 
Thus, it is contemplated by the invention that compositions comprising HDx therapeutics 
can also be utilized for both cell culture and therapeutic methods involving generation 
and maintenance of tissue. 

In a further embodiment of the invention, the subject HDx therapeutics vwll be 
useful in increasing the amount of protein produced by a cell or recombinant cell. The 
cell may include any primary cell isolated fi-om any animal, cultured cells, immortalized 
cells, and established cell lines. The animal cells used in the present invention include 
cells which intrinsically have an ability to produce a desired protdn; cells which are 
induced to have an ability to produce a desired protein, for example, by stimulation with 
a cytokine such as an interferon, an interieukin; genetically engineered cells into which a 
gene for a desired protein is introduced. The protein produced by the process could 
include any peptides or proteins, including peptide hormone or protdnaceous hormones 
such as any useful hormone, cytokine, interieukin, or protein which it may be desirable to 
have in purified form and/or in large quantity. 

Another aspect of the invention features transgenic non-human animals which 
express a heterologous HDx gene of the present invention, or which have had one or 
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more genomic HDx genes disrupted in at least one of the tissue or cell-types of the 
animal. Accordingly, the invention features an animal model for developm«ital diseases, 
which animal has one or more HDx allele which is mis-expressed. For example, a mouse 
can be bred which has one or more HDx alleles deleted or otherwise rendered inactive. 
Such a mouse model can then be used to study disorders arising from mis-expressed HDx 
genes, as well as for evaluating potential therapies for similar disorders. 

Another aspect of the present invention concerns transgenic animals which are 
comprised of cells (of that animal) which contain a transgene of the present invention and 
which preferably (though optionally) express an exogenous HDx protein in one or more 
cells in the animal. An HDx transgene can encode the wild-type form of the protein, or 
can encode homologs thereof, including both agonists and antagonists, as well as 
antisense constructs. In preferred embodiments, the expression of the transgene is 
restricted to specific subsets of cells, tissues or developmental stages utilizing, for 
example, cis-acting sequences that control expression in the desired pattern. In the 
15 present invention, such mosaic expression of an HDx protein can be essential for many 
forms of lineage analysis and can additionally provide a means to assess the eflfects of, for 
example, lack oiHDx expression which might grossly alter development in small patches 
of tissue within an otherwise normal embryo. Toward this and, tissue-specific r^latory 
sequences and conditional r^latory sequences can be used to control expression of the 
20 transgene in certain spatial patterns. Moreover, temporal patterns of expression can be 
provided by, for example, conditional recombination systems or prokaryotic 
transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be regulated 
via site-specific genetic manipulation in vivo are known to those skilled m tiie art. For 
25 instance, genetic systems are available which allow for the regulated expression of a 
recombinase that catalyzes the genetic recombination a target sequence. As used herein, 
the phrase "target sequence" refers to a nucleotide sequence that is genetically 
recombined by a recombinase. The target sequence is flanked by recombinase 
recognition sequences and is generally either excised or inverted in cells expressing 
recombinase activity. Recombinase catalyzed recombination events can be designed such 
that recombmation of the target sequence results in eitiier the activation or repression of 
expression of one of the subject HDx proteins. For example, excision of a target 
sequence which interferes with the expression of a recombinant HDx gene, such as one 
which encodes an antagonistic homolog or an antisense transcript, can be designed to 
activate expression of that gene. This interference with expression of tiie protein can 
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result from a variety of mechanisms, such as spatial separation of the HDx gene from the 
promoter element or an internal stop codon. Moreover, the transgene can be made 
v^herein the coding sequence of the gene is flanked by recombinase recognition sequences 
and is initially transfected into cells in a 3* to 5' orientation wth respect to the promoter 
5 element. In such an instance, inversion of the target sequence will reorient the subject 
gene by placing the 5* end of the coding sequence in an orientation with respect to the 
promoter element which allow for promoter driven transcriptional activation^ 

In an illustrative embodiment, either the cre/loxP recombinase system of 
baaeriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 

10 89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman 
et al. (1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to 
generate in vivo site-specific genetic recombination systems. Cre recombinase catalyzes 
the site-specific recombination of an intervening target sequence located between loxP 
sequences. loxP sequences are 34 base pair nucleotide repeat sequences to which the Cre 

15 recombinase binds and are required for Cre recombinase mediated genetic recombination. 
The orientation of loxP sequeifces determines whether the intervening target sequence is 
excised or inverted when Cre recombinase is present (Abremski et al. (1984) J. BioL 
Chem. 259:1509-1514); catalyzing the excision of the target sequence when the loxP 
sequences are oriented as direct repeats and catalyzes inversion of the target sequence 

20 when loxP sequences are oriented as inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on 
expression of the Cre recombinase. Expression of the recombinase can be regulated by 
promoter elements which are subject to regulatory control, e.g., tissue-specific, 
developmental stage-specific, inducible or repressible by externally added agents. This 
25 regulated control will result in genetic recombination of the target sequence only in cells 
where recombinase expression is mediated by the promoter element. Thus, the activation 
expression of a recombinant HDx protein can be regulated via control of recombinase 
expression. 

Use of the cre/loxP recombinase system to regulate expression of a recombinant 
30 HDx protein requires the construction of a transgenic animal containing transgenes 
encoding both the Cre recombinase and the subject protein. Animals containing both the 
Cre recombinase and a recombinant HDx gene can be provided through the construction 
of "double" transgenic animals. A convenient method for providing such animals is to 
mate two transgenic animals each containing a transgene, e.g., an HDx gene and 
3 5 recombinase gene. 
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One advantage derived from initially constmcting transgenic animals containing 
an HDx transgene in a recombinase-mediated expressible format derives from the 
likelihood that the subject protein, whether agonistic or antagonistic, can be deleterious 
upon expression in the transgenic animal. In such an instance, a founder population, in 
5 which the subject transgene is silent in all tissues, can be propagated and maintained. 
Individuals of this founder population can be crossed with animals expressing the 
recombinase in, for example, one or more tissues and/or a desired temporal pattern 
Thus, the creation of a founder population in which, for example, an antagonistic HDx 
transgene is silent will allow the study of progeny from that founder in which disruption 
1 0 of HDx mediated induction in a particular tissue or at certain developmental stages would 
result in, for example, a lethal phenotype. 

Similar conditional transgenes can be provided using prokaryotic promoter 
sequences which require prokaryotic proteins to be simultaneous expressed in order to 
facilitate expression of the HDx transgene. Exemplary promoters and the corresponding 
15 trans-activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene 
therapy-like methods wherein a gene encoding the trans-activating protein, e.g. a 
recombinase or a prokaryotic protein, is delivered to the tissue and caused to be 
expressed, such as in a cell-type specific manner. By this method, an HDx transgene 
20 could remain silent into adulthood until "turned on" by the introduction of the trans- 
activator. 

In an exemplary embodiment, the "transgenic non-human animals" of the 
invention are produced by introducing transgenes into the germline of the non-human 
animal. Embryonic target cells at various developmental stages can be used to introduce 

25 transgenes, Diflferent methods are used depending on the stage of development of the 
embryonic target cell. The zygote is the best target for micro-injection. In the mouse, the 
male pronucleus reaches the size of approximately 20 micrometers in diameter which 
allows reproducible injection of l-2pl of DNA solution. The use of zygotes as a target for 
gene transfer has a major advantage in that in most cases the injected DNA will be 

30 incorporated into the host gene before the first cleavage (Brinster et al. (1985) PNAS 
82:4438-4442). As a consequence, all cells of the transgenic non-human animal will carry 
the incorporated transgene. This will in general also be reflected in the efficient 
transmission of the transgene to offspring of the founder since 50% of the germ cells will 
harbor the transgene. Microinjection of zygotes is the preferred method for incorporating 

35 transgenes in practicing the invention. 
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Retroviral infection can also be used to introduce HDx transgenes into a non- 
human animal. The developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retroviral infection 
(Jaenich, R. (1976) PNAS 73:1260-1264). Efficient infection of the blastomeres is 
5 obtained by en2ymatic treatment to remove the zona pellucida (Manipulatmg the Mouse 
Embryo, Hogan eds. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). 
The viral vector system used to introduce the transgene is typically a replication-defective 
retrovirus carrying the transgene (Jahner et al. (1985) PNAS 82:6927-6931; Van der 
Putten et al. (1985) PNAS 82:6148-6152). Transfection is easily and efficiently obtained 

10 by culturing the blastomeres on a monolayer of virus-producing cells (Van der Putten, 
supra; Stewart et al. (1987) EMBO J. 6:383-388). Alternatively, infection can be 
performed at a later stage. Virus or virus-producing cells can be injected into the 
blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the founders will be 
mosaic for the transgene since incorporation occurs only in a subset of the cells which 

15 formed the transgenic non-human animal. Further, the founder may contain various 
* retroviral insertions of the transgene at different positions in the genome which generally 
will segregate in the offspring. In addition, it is also possible to introduce transgenes into 
the germ line by intrauterine retroviral infection of the midgestation embryo (Jahner et al. 
(1982) supra). 

20 A third type of target cell for transgene introduction is the embryonic stem cell 

(ES), ES cells are obtained from pre-implantation embryos cultured in vitro and fiised 
with embryos (Evans et al. (1981) Nature 292:154-156; Bradl^r et al. (1984) Nature 
309:255-258; Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) 
Nature 322:445-448). Transgenes can be efficiently introduced into the ES cells by DNA 

25 transfection or by retrovirus-mediated transduction. Such transformed ES cells can 
thereafter be combined with blastocysts from a non-human animal. The ES cells 
thereafter colonize the embryo and contribute to the germ line of the resulting chimeric 
animal. For review see Jaenisch, R, (1988) Science 240: 1468-1474. 

Methods of making HDx knock-out or disruption transgenic animals are also 
30 generally known. See, for example, Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N,Y., 1986). Recombinase dependent 
knockouts can also be generated, e.g. by homologous recombination to insert 
recombinase target sequences flanking portions of an endogenous HDx gene, such that 
tissue specific and/or temporal control of inactivation of an HDx allele can be controlled 
35 as above. 
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Exemplificaiion 

The invention, now being generally described, will be more readily understood by 
reference to the following examples, which are included merely for purposes of 
5 illustration of certain aspects and embodiments of the present invention and are not 
intended to limit the invention, - - 

Example I 

Trapoxin is a microbialiy derived cyclotetrapeptide that inhibits histone 
10 deacetylation in vivo and causes mammalian cells to arrest in the cell cycle. A trapoxin 
aflSnity matrix was used to isolate two nuclear proteins that copurified with histone 
deacetylase activity. Both protdns were identified by peptide microsequencing, and a 
cDNA encoding the histone deacetylase catalytic subunit (HDJ) was cloned from a 
Jurkat T cell library. As the predicted protein is highly similar to the yeast transcriptional 
15 regulator RPD3, this study supports a role for histone deacetylase as a key regulator of 
eukaryotic transcription. 

A requu-ement for a functional histone deacetylase in cell cycle progression has 
been implicated by the discovery that two cytostatic agents, trapoxin and trichostatin 
(Figure lA), inhibit histone deacetylation in cultured mammalian cells and in fractioned 
20 cell extracts (4). In addition to causing G| and G2 phase cell cycle arrest, these natural 
products alter gene expression and induce certain mammalian cell lines to differentiate. 
Whereas sodium butyrate also has these properties, both trapoxin and trichostatin are five 
orders of magnitude more potent. 

Trapoxin is an "irreversible" inhibitor of histone deacetylase activity and its 
25 molecular structure oflfers clues as to how it could form a covalent bond with a 
nucleophilic active site readue. First, trapoxin contains an electrophilic epoxyketone that 
is essential for biological activity (5). Second, the aliphatic epox>icetone side chain is 
approxinutely isosteric with N-acetyl lysine (Figure lA). Trapoxin likely acts as a 
substrate mimic, with epoxyketone poised to alkylate an active site nucleophile. We 
30 therefore regarded trapoxin as a tool that could reveal the molecular identity of histone 
deacetylase, so that its role in transcriptional regulation and cell cycle progression could 
be elucidated. 
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Tritium-labeled trapoxin was prepared by total synthesis and used to identify 
trapoxin binding protein in crude extracts from bovine thymus. We used a charcoal 
precipitation assay to detect a specific trapoxin binding activity primarily in the nuclear 
fraction of the extracts (6). The binding activity was saturable with nanomolar 
5 concentrations of [^H]trapoxin and was completed by the simultaneous addition of 
unlabled trapoxin. Trichostatin also competed with [^H]trapoxin (for synthesis, see 
Example 2), suggesting that both of these compounds exert their cellular effects by 
targeting the same molecule. 

If trapoxin and trichostatin induce cell cycle arrest by directly inhibiting histone 
10 deacetylase, then the binding and enzymatic activities should copurify. To investigate 
this possibility, we fractioned nuclear thymus proteins by ammonium sulfate precipitation 
and Mono Q anion exchange chromatography. 

Briefly, thymocytes {--12 g) prepared from fresh bovine thymus were 
homogenized in hypotonic lysis buffer [20 mM tris (pH 7.8), 20 mM NaCl, 1 mM EDTA, 

15 10% glycerol, ImM PMSF, ImM benzamidine, 10 jig/ml each of pepstatin, aprotinin, 
and leupeptin] by mechanical disruption and the nuclei were isolated by centrifiigation at 
3000g. Nuclei were resuspended in lysis buffer and the proteins were extracted wth 0.4 
M ammonium sulfate. The viscous lysate was sonicated and clarified by centrifiigation at 
100,000g for one hour. Proteins were then precipitated with 90% saturated anunonium 

20 sulfate and recovered by centrifiigation (100,000g, one hour). After through dialysis 
against Q buffer (25 mM tris pH 8, 10 mM NH CI, 0.25 mM EDTA, 10% glycerol), a 
portion of the nuclear proteins (-12 mg total protein) was loaded onto a HR 10/10 Mono 
Q column (Pharmacia). The column was washed with 25ml Q bufifer and eluted with a 
50 ml linear gradient of 10 to 500 mM NH4 CI. The column was further washed with 25 

25 ml 500 mM NH4 and 25 ml 1 M histone deacetylase activities or further purified with 
the K-trap affinity matrix. All procedures were done at 4^, 

Two peaks of histone deacetylase activity eluted from the Mono Q column 
between 250 and 350 mM NH4CI (Figure IB). Trapoxin binding activity, as revealed by 
the charcoal precipitation assay (40 nM pHQtrapoxin), precisely coeluted with the histone 
30 deacetylase peaks. Furthermore, all detectable histone deacetylase activity was abolished 
by treatment vath either trapoxin or trichostatin (20 nM). Similar results were obtained 
with Mono Q firactioned nuclear extracts prepared form human Jurkat T cells. 

To purify the histone deacetylase further, we synthesized an affinity matrix based 
on the trapoxin structure. Because trapoxin itself is not amenable to derivatization and 
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the epoxyketone side chain is indispensable for activity, we chose to replace one of the 
phenylalanine residues of trapoxin's cyclic core with a lysine that could then be covalently 
linked to a solid support. This molecule, which we call K-trap, was prepared by a twenty 
step synthesis starting with commercially available (R)-proline and (S,S)- threiitol 
5 acetonide (Figure 2A) (see Example 3). Synthetic K-trap inhibited pH]thymidine 
incorporation in MG-63 human osteosarcoma cells with a potency approximately one 
tenth that of trapoxin. In vitro histone deacetylasljiactivity was also inhibited potently by 
this compound (complete ihactivation at 20 nM) (8). 

K-trap was deprotected with Pd(Ph3P)4 and coupled to an activated agarose 
10 matrix (Figure 2 A). Mono Q fractions containing nuclear proteins from bovine thymus 
were incubated with the K-trap affinity matrix and then tested for both trapoxin binding 
and histone deacetylase activity. Both activities were depleted (90%) by treatment with 
the K-trap matrix, yet a control matrix capped with ethanolamine had no efifect on either 
activity (8). Bound polypeptides were eluted by boiling the matrix in 1% SDS buffer and 
15 separated b polyacrylamide gel electrophoresis. In vitro binding experiments with soluble 
pHJtrapoxin indicated that the radiolabel is released into solution following protein 
denaturation with SDS or gunaidinium hydrochloride. Thus, trapoxin binding proteins 
were expected to elute from the aflSnity matrix with SDS. 

The silver stained gel of the affinity matrix eluates revealed sbc major polypeptides 
20 with apparent molecular sizes between 45 and 50kD (Figure 2B). The interaction 
between bovine p46-p50 and the K-trap matrix appeared to be specific, because these 
proteins were not retained when the incubation was done in the presence of either 
trapoxin or trichostatin (Figure 2B), nor were they structurally unrelated histone 
deacetylase inhibitor, trichostatin, to prevent p46-p50 from binding to the K-trap matrix 
25 implies that one or more of these polypeptides constitute the biologically relevant protein 
target of both trapoxin and trichostatin. When the affinity purification was repeated with 
Jurkat nuclear extracts, only two major bands, p50 and p55, were observed by silver 
staining (Figure 2B). Recovery of human p50 and p55 was similarly abolished by 
trapoxin (Figure 2B) and trichostatin (8). Because the relative intensities of bovine p46- 
30 p49 vary with each protein preparation, we suspect that they are proteolytic fragments 
derived from the bovine equivalent of human p55. One of the bands (p50) is common to 
both human and bovine sources. 

Large scale purification of the bovine proteins led to the resolution of two major 
bands of and --50 kD in the final preparative electrophoresis step, both of which 
35 were submitted for microsequencing. 
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To obtain enough trapoxin binding protein for microsequencing, nuclear 
ammonium sulfate pellets from 15 bovine thymuses were prepared as described above. 
Sedimented proteins were resuspended in and dialyzed against buflFer A [20 mM bistris 
(pH 7.2), 20 mM NaCI, 10% glycerol] for 12 hours, and brought to pH 5.8 by dialyzing 
5 against bugger A (pH 5.8) for 30 minutes. After centrifugation, the dialysate (--650 mg 
protein) was loaded onto a Q Sepharose FF colunm (2.6 x 10 cm; Pharmacia) and the 
column was washed with4'20^ml buflFer A (pH 5.8). Proteins wee eluted with a 400 ml 
linear gradient of 20 to 600 mM NaCl in buflfer A. Fractions (10 ml; each fraction 
contained 1 ml of 1 M tris pH 8 to neutralize the addic buffer A) were assayed for 

10 trapoxin binding activity. Tween-20 was added to active fractions at a final 
concentration of 0.05%, and these fractions were incubated with K-trap affinity matrix 
for 16 hours (25 fil per ml Q fraction). After washing the matrix three times with 
phosphate buffered saline, bound proteins were eluted by boiling in 40 ^1 of SDS sample 
buffer per 25 ^1 of matrix. SDS eluates were combined and the proteins resolved by 

15 SDS-PVDF membrane (Biorad). Staining with Ponceau S revealed two nugor bands (46 
and 50 kD). The excised bands were proteolytically digested and the HPLC purified 
peptide fragments were sequenced at the Harvard Microchemistry Facility. 

The bovine protein of larger molecular size (-50 kD) corresponds to a known 
protein, RbAp48 (1 1), that consists of seven WD repeat domains (12). Originally 
20 identified as a protein that binds to the retinoblastoma gene product (pRb), RbAp48 may 
constitute an adaptor subunit that targets the histone deacetylase to specific chromatin 
domains. 

The --46 kD bo^nne protein is highly related to the protein encoded by the yeast 
RPD3 gene, which has been implicated by several genetic screens as a transcriptional 

25 regulator, but whose biochemical fimction is unknown (13). Partial cDNA sequences for 
the human gene were identified in the expressed sequence tag database (dbEST) and 
were used to design polymerase chain reaction (PGR) primers. Briefly, after noting 
sequence similarity between peptides derived from the purified bovine trapoxin binding 
protein and yeast RPD3, we checked dbEST to see whether any partial sequences for the 

30 human homologue had been reported. Two ESTs (Genbank accession numbers: 
D31480 and F07807) were identified whose predicted translation products aligned with 
high sequence similarity to NH2- and COOH-terminal re^ons ofHDJ, respectively, PGR 
primers were designed based on these tags and a one kilobase PGR product was obtained 
from a Jurkat cDNA library (Stratagene). A ^^P labeled probe prepared by random 

35 priming was used to screen the Jurkat library, and ten positive clones were isolated. One 
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of the clones was fully sequenced and found to contain a putative fiiU-length open 
reading frame (Figure 3A). The peptide sequences obtained from the purified bovine 
protein align with 100% identity to sequences deduced from this coding region (Figure 
3A, boxed residues). We call this human protein HDl (for histone deacetylase), and its 
5 predicted size of 55 kD agrees well with the est'unated size of p55 isolated from Jurkat 
nuclear extracts using the K-trap affinity matrix (Figure 2B). A dbEST search indicated 
the existence of at least two other related human genes. 

To determine the relationship between the protdns from bovine thymus (p46- 
p50) and the proteins isolated from human Jurkat T cells (p50 and p55), an antiserum 

10 was generated against a peptide specified by the HDl open reading frame (Figure 3A, 
amino acids 319 to 334). Immunoblot analysis of the bovine proteins p46-p49 and the 
human protein p55 showed that they all react with the andserum and provides additional 
evidence that these bands correspond to bovine and human HDl (Figure 3B). A 
monoclonal antibody that specifically recognizes RbAp48 was used to confirm the 

15 identity of bovine and hum p50. Importantly, neither HDl nor RbAp48 was detected 
when the affinity purification was done in the presence of trapoxin or trichostatin (Figure 
3B), 

We used affinity purified antibodies directed against a COOH-terminal peptide 
(amino acids 467 to 482) to immunoprecipitate HDl from crude nuclear extracts. The 

20 immunoprecipitates contained histone deacetylase activity that was inhibited by both 
trapoxin and trichostatin (Figure 4A). Consistent with the idea that HDl and RbAp48 
form a complex in vivo, the two proteins copredpitated with the anti-DHl antibodies 
(Figure 4B). Neither HDl^ RbAp48, nor the associated histone deacetylase activity were 
immunoprecipitatcd in the presence of the HDl COOH-terminal peptide (Figure 4A and 

25 4B) (15). HDl, like RbAp48 (11), is detected predominantly in the nucleus by 
immunostaining vrith the aforementioned antibodies (8). Given that HDl and RbAp48 
are the major proteins eluted from the K-trap matrbc (Figure 2B), it is likely that they 
interact directly with one another. 

We extended the results obtained with the endogenous protein by expressing 
30 recombinant FLAG epitope tagged HDl (HDl-F) in Jurkat T cells. Anti-FLAG 
immunoprecipitates from cells transfected with pBJ5/M)/-F contained histone 
deacetylase activity that was sensitive to both trapoxin and trichostatin figure 4C). 
Histone deacetylase activity was not precipitated when the antibody was blocked with 
excess FLAG peptide (15). Interestingly, endogenous RbAp48 did not coprecipitate with 
35 overexpressed HDl-F (8), demonstrating that RbAp48 is not required for either histone 
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deacetylase or trapoxin binding activity. The result is consistent with the idea that 
RbAp48 serves a targeting rather than an enzymatic function. Finally, lysates from cells 
transfected with pBJ5/HDJ-F were incubated with the K-trap affinity matrix in the 
presence or absence of trapoxin and trichostatin. Protein immunoblot analysis 
5 demonstrated an interaction between recombinant HDJ-F and the K-trap affinity matrix 
that was fully competed by nanomolar concentrations of trapoxin or trichostatin (Figure 
4D). 

HDJ is 60% identical to the protein encoded by the yeast RPD3 gene, which was 
isolated in four independent mutant suppressor screens designed to identify 

10 transcriptional repressors (13, 16, 17, 18, 19). No biochemical function for the yeast 
protein has previously been postulated. A negative regulator of the TRK2 gene, RPD3 is 
necessary for the transcriptional repression of several genes whose expression is 
regulated according to specific environmental conditions. Loss of RPD3 also leads to 
decreased transcriptional activation of certain genes, but this effect may be indirect (13, 

15 17). Although RPD3 had yet to be implicated in silencing at telmomeres or the mating 
loci, the fact that silencing is eliminated by point mutations in specific lysine residues near 
the NH2-terminus of histones H3 and H4 suggests that lysine deacetylation may 
contribute to the maintenance of silenced chromatin (20, 21, 22, 23). Indeed, silencing at 
telomeres and the mating loci has been correlated with the presence of hypoacetylated 

20 histones, and sir mutants which are defective in silencing show a corresponding increase 
in the extent of histone acetylation at these loci (24). The SIR3 and SIR4 proteins have 
been shown to interact with a bacterially expressed histone H4 NH2*^®™™^ domain in 
vitro (25), and it is possible that deacetylation of one or more lysine residues is required 
for this interaction in vivo. Our results further support a role for histone deacetylase as a 

25 transcriptional regulator and establish a biochemical connection to the genetic studies that 
originally characterized RPD3, 

How does inhibition of histone deacetylase in mammalian cells lead to G| and G2 
phase cell cycle arrest? One possibility is that specific cell cycle regulatory proteins such 
as the cyclin dependent kinase inhibitors are transcriptionally upregulated in response to 
30 histone deacetylase inactivation. Alternatively, cell cycle checkpoints may exist that 
monitor histone acetylation or higher-order chromatin structure. It should now be 
possible to study the regulation of histone deacetylase during the cell cycle, its substrate 
specificity, and the mechanism by which it is targeted to specific regions of the genome. 
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Example 2 

3H-Trapoxin was prepared from (S,S)-threitoI acetonide (9) by total synthesis, as 
outlined in Figures 8A-8C. 

As shown in Figure 8A, (S,S)-threitol acetonide (9) was monoprotected by 
5 treatment with triisopropylsilylchloride (TIPSCl) and sodium hydride in tetrahydrofiiran 
(THF). The free alcohol was then subjected to Swem oxidation, Wittig reaction of the 
resulting aldehyde gave compound 10 in good yield for the three steps. Compound 10 
was then hydrogenated with deprotection of the primary alcohol, which was then 
converted to the bromide 11 in excellent yield. Bromide 11 was converted to the 
10 organocuprate and reacted with (S)-serine P-lactone to yield the benzyloxycarbonyl- 
(Cbz) protected amino acid 12. 

As shown in Figure 8B, 12 was coupled to tripeptide methyl ester 14, and the 
methyl ester was saponified. The amino acid was then cyclized and the silyl protecting 
group was removed to yield cyclotetrapeptide 18 in 51% yield. 

15 Cyclotetrapeptide 18 was tritiated, as shown in Figure 8C, by oxidation of the 

primary alcohol with the Dess-Martin reagent, and the aldehyde was reduced with 
tritiated sodium borohydride to provide tritiated 18, which was converted to 
[3H]Trapoxin B by tosylation of the primary alcohol, deprotection of the diol, epoxide 
ring closure, and oxidation of the secondary alcohol to yield the desired compound. 

20 Non-radiolabelled 18 was converted to [^HjTrapoxin B, via tosylate 19, in 68% overall 
yield. 

Example 2 

K-Trap was prepared from (S,S>-threitol acetonide (9) by total synthesis, as 
25 outlined in Figures 9A-9C. As shown in Figure 9A, monoprotection and Swm 
oxidation of 9 yielded the aldehyde as above. Wittig homologation yielded carboxylic 
acid 20, which was converted to the mbced anhydride and treated with lithiated 
oxazolidinone 21 to provide 22 in excellent yield. Deprotection of the primary alcohol 
and conversion to the tosylate were followed by treatment of the potassium enolate with 
30 trisylazide according to the method of Evans to eflfect electrophilic aade transfer in good 
overall yield and stereoselectivity, providing compound 23. Removal of the chiral 
auxiliary and catalytic reduction of the azido function, with hydrogenation of the olefin, 
provided amino acid 24. which was N-protected to give the Fmoc derivative 25 in high 
overall yield. 
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Referring to Figure 9B, protected amino acid 25 was coupled to tripeptide methyl 
ester 26. The methyl ester was saponified to ^eld 27, which was cyclized under high- 
dilution conditions to provide cyclotetrapeptide 28 in 58% yield.. 

As shown in Figure 9C, compound 28 was converted to K-trap (29) by 
5 deprotection of the diol, base-promoted epoxide closure, and oxidation of the secondary 
alcohol to provide K-trap (29) in good overall yield. The K-trap afOnity matrix 30 was 
provided by palladium-catalyzed removal of the allyloxycarbonyl (Alloc) group from the 
lysine residue of 29, and immobilization on AfBgel 10. 
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- - ^ Example 3 

Histone Deacetylase Activity is Required 
for Full Transcriptional Repression by mSin3 A 

10 

The Mad family of basic region-Helix-Loop-Helix-Leucine Zipper (BHLHZip) 
proteins play an important role in controlling cell proliferation and diflferentiation (for 
reviews see: Amati and Land, 1994; Bernards 1995). Four identified Mad family 
members: Madl, Mxil, Mad3 and Mad4 (Ayer et al., 1993; Zervos et al., 1993; Hurlin et 
15 al., 1995a) form heterodimers with another BHLHZip protein, Max; to repress 
transcription (Ayer et al., l993;,Hurlin et al., 1995a) and are thought to play a negative 
role in the control of cell proliferation. 

Two mammalian homologs of the Saccharomyces cerevisiae transcriptional 
corepressor SIN3, mSin3 A and mSin3B, have recently been identified as Mad interacting 

20 proteins and are required for Mad-mediated transcriptional repression (Ayer et al., 1995; 
Schreiber-Agus et al., 1995). The most conserved regions of these proteins correspond 
to four putative paired amphipathic helix (PAH) motifs, which have been proposed to 
constitute protein-protein interaction surfaces (Wang et al., 1990). The second PAH 
motif in mSin3A, mSin3B and Sin3p interacts with the mSin3 interaction domain or SID 

25 in the amino terminus of the four Mad family members (Ayer et al., 1995; Schreiber-Agus 
et al., 1995; Hurlin et al., 1995a; Kasten et al., 1996). Madl, Max and mSin3A form 
ternary complexes capable of binding DNA (Ayer et al., 1995). Point mutations in the 
SID domain of Madl disrupt its ability to bind mSinSA, negate its fimction as a 
trancriptional repressor (Ayer et al., 1995), and eliminate Madl function in several 

30 biological assays (Koskinen et al., 1995; Roussel et al., 1996). These findings suggest 
that Mad:Max heterocomplexes repress transcription by tethering either mSin3A or 
mSin3B to DNA. A chimeric protein fixsing the SID of Madl to the GAL4 DNA-binding 
domain results in repression of simple and complex promoters in a manner that is 
dependent on mSin3 binding, suggesting that targeting mSin3 to DNA is necessary for 
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repression (Ayer et al., 1996). Nevertheless, the molecular mechanisni(s) for mSin3A— 
mediated repression remain unknown. 

As described in example 1, a mammalian histone deacetylase has been identified, 
and cDNAs encoding the protein, histone deacetylase 1 (HDl or HDACI for the 
5 purposes of this example), have been cloned (see also Taunton et al., 1996b). HDACI is 
approximately 60% identical to the 5. cerevisiae RPD3 protein, which is a component of 
a yeast histone deacetylase complex (Rundlett et al., 1996). Single mutations in either 
RPD3 or SIN3 give the same phenotypes as RPD3/SIN3 double, mutants suggesting that 
they function in the same pathway (Stillman et al., 1994). Because Mad family proteins 

10 use mSin3A as a corepressor and Madl can repress transcription in wild-type yeast but 
not yeast having a null mutation in SIN3 (Kasten et al., 1996) or RPD3 (D.J. Stillman, 
personal comm.), it is likely that the mechanism of transcriptional repression by Mad 
proteins may be conserved between yeast and higher eukaryotes. Consistent with this 
hypothesis, the DNA-binding transcription factor YYl interacts with a mammalian RPD3 

15 homolog to repress transcription of a heterologous reporter gene (Yang et. at, 1996). 
These results demonstrate that mammalian RPD3-like activity functions in transcriptional 
regulation. 

Several lines of evidence suggest that the acetylation status of conserved lysines in 
the amino terminal domains of histones H3 and H4 play a role in the regulation of 

20 transcription. In general, histone hyperacetylation correlates with transcriptionally active 
or poised genes; conversely, hypoacetylation correlates with transcriptionally repressed 
heterochromatin (for reviews see: Turner, 1993; Loidl, 1994; Wolflfc, 1996). While Uttle 
is known about the targeting and regulation of histone acetyltransferases and 
deacetylases, it has been recently shown that several transcriptional coactivators possess 

25 inherent acetyltransferase activity (Brownell et al., 1996; Ogiyzko et al., 1996) or 
associate with acetyltransferases (Yang et al., 1996b). We report that mSin3A and 
HDACI associate in vivo and that the histone deacetylase inhibitor trapoxin interferes 
with mSin3 A-mediated transcriptional repression. 

30 Results 

(i) mSin3 A is present in cells as a large stable multiprotein complex. 

To study the in vivo function of mSin3A we generated polyclonal antiserum 
specific for the PAH2 domain of mSin3A. We tested this antiserum by 
inununoprecipitation using nuclear lysates made from the myeloid leukemia cell line U937 
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that had been metabolically labeled with ^^S-methionine. Analysis of inununoprecipitates 
showed an intensely-labeled doublet with an apparent molecular weight of 150 
kiloDaltons that was present in the anti-mSin3A inununoprecipitates (Figure lOA). This 
doublet comigrated with in vitro translated-mSinSA, shared identical V8 protease 
5 digestion peptides with in vitro translated-Sin3A and was absent from 
unmunoprecipitations using preimmune senim or immune serum preincubated with the 
cognate immunogen (data not shown). 

Fractionation of U937 nuclear extracts by size exclusion chromatography 
indicated that mSinSA is present in large molecular weight complex(es) (D.E.A., 

10 unpublished). To address this possibility, we performed immunoprecipitations from 
metabolically-labeled U937 cells under conditions that should preserve protdn-protein 
interactions. In addition to mSinSA, the low-stringency mSm3A immunoprecipitates 
contsdned several labeled polypeptides of apparent molecular weight 250 IcDa, 180 kDa, 
55 kDa, 50 kDa, 42 kDa, 33-36 kDa and 30 kDa (Figure IDA). These proteins were not 

15 detected in immunoprecipitates using mSin3A antiserum blocked with the cognate 
immunogen, suggesting that the proteins detected are specifically associated with 
mSin3A Furthermore, none of these proteins were detected using high-stringency 
immunoprecipitation or by western blotting of whole-cell lysates using anti-Sin3A, 
suggesting that they do not share epitopes with mSin3A and are not proteolytic 

20 breakdown products of mSin3 A (data not shown). All of the associated proteins appear 
to be present in substoichiometric amounts to mSin3A, suggesting that mSin3A 
complexes are heterogeneous. 

To test the stability of the mSin3 A complex, we subjected low-stringency mSin3 A 
immunoprecipitates to different salt concentrations and ionic detergent conditions. The 

25 proteins that remained bound to mSin3 A in the immunecomplex were analyzed by SDS- 
PAGE. Under the most stringent conditions we observed only a slight loss of mSin3A- 
associated proteins in the immune complex (Figure lOB). One exception to this finding 
was the apparently quantitative loss of p42 under slightly-elevated salt concentrations. 
These findings demonstrate that the mSin3 A complex is stable in vivo and suggests that 

30 some or all of the mSin3A-associated proteins may facilitate mSin3A fijnction as a 
transcriptional co-repressor. 



(ii) HDACl and RbAp48 are components of the mSin3A complex. 
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Because SIN3 and RPD3 appear to function in the same pathway in yeast and two 
components of the mSinSA complex, p50 and p55, are similar in apparent molecular 
weight to HDACl we hypothesized that HDACl or related proteins might be 
components of the mSinSA repressor complex. To test this hypothesis, the protdns 
5 bound to mSin3A immunecompiexes were eluted with ionic detergents and reprecipitated 
with affinity purified antibodies specific for an internal peptide of HDACl. Only two 
proteins eluted from the mSihS A complex were re-precipitated by HDACl antiserum 
(Figure 1 1 A). These polypeptides comigrated with p50 and p55 from the low stringency 
mSin3A immunoprecipitation, suggesting that proteins highly related to HDACl are 
10 complexed to mSinSA in vivo. p55 comigrates with in vitro translated HDACl and is 
recognized by an antibody specific for a carboxy-terminal epitope unique to HDACl. 
Another CDNA encoding an HDACl homolog, HDAC2, has recently been identified 
(Yang et al., 1996a). It is likely that p50 represents HDAC2 (data not shown). 

In a reciprocal experiment, we performed low stringency immunoprecipitations 
15 using antiserum specific for an epitope at the carboxy-terminus of HDACl. HDACl 
inununoprecip-itates contain several proteins that were specifically competed with the 
immunizing peptide (Figure 1 IB). A polypeptide doublet that comigrated with mSin3A 
was detected in the HDACl inmiunocomplexes (Figure 1 IB and 1 IC). To confirm that 
the doublet coprecipitating with HDACl is mSinSA, the HDACl immunocomplex was 
20 eluted and reprecipitated with antiserum specific for mSin3A (Figure IIC). The two 
proteins in this precipitate comigrated with mSin3A, confirming that mSin3A and 
HDACl are associated in vivo. 

To determine whether HDACl associated with mSin3A in vivo is enzyraatically 
active, we assayed low-stringency immunoprecipitates for histone deacetylase activity. 

25 We used a synthetic peptide corresponding to the first twenty four amino acids of histone 
H4 as a substrate for our deacetylase assay (Taunton et al., 1996b). Low-stringency anti- 
mSinSA immunoprecipitates contained deacetylase activity; however, only background 
levels of deacetylase activity were detected in the immunoprecipitates if the mSinSA anti- 
serum was blocked with cognate immunogen (Figure 1 ID). To confirm the authenticity 

30 of the mSin3A assodated activity we treated the immunoprecipitates with synthetic 
trapoxin, a specific inhibitor of histone deacetylase activity (Taunton et al., 1996a). 
Treatment of mSin3A complexes in vitro with 10 nM trapoxin reduced deacetylation by 
approximately 50% (Figure UD), suggesting that the precipitated deacetylase activity 
can be attributed to trapoxin-sensitive histone deacetylases bound to mSin3 A. 
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We detected an interaction between HDACl and RbAp48 in vivo (Figure IIB, 
1 IC and Example 1). The low-stringency mSinSA immunoprecipitation shown in Figure 
1 IC also contained a protein that comigrated with RbAp48 (marked with an asterisk) that 
was not readily visible on the shorter exposures of low stringency mSin3 
5 immunoprecipitations (Figure 11 A). We have identified RbAp48 in mSin3A 
immunoprecipitates from cell extracts of nontransfected cells by western blotting, further 
demonstrating that mSinS A and RbAp48 associate in vivo (Figure 12A). 

To address fiirther the association between HDACl and RbAp48 with mSinSA, 
we expressed the mammalian proteins in insect cells using recombinant baculoviruses. To 

10 this end, we expressed recombinant FLAG-epitope tagged HDACl (HDACl-F) that 
could be immuno-purified by anti-FLAG antibodies and histidine-tagged mSin3A 
(mSin3A-H) that could be purified by nickel aflMty (data not shown). HDACl-F was 
immunoprecipitated from infected Sf9 cell extracts by anti-FLAG antibodies in the 
presence or absence of mSin3A-H. HDACl-F was also precipitated by Ni^*-NTA 

15 agarose in a manner that was dependent on coexpression of mSin3A-H (Figure i2B), 
demonstrating that a complex between HDAC 1 and mSin3A is formed in insect cells 
using exogenously expressed human proteins. 

Consistent with our finding that RbAp48 is associated with mSinSA and HDACl 
in vivo, we show that baculovarus expressed Flu-epitope tagged RbAp48 (p48-HA) is 

20 specifically precipitated from infected Sf9 cell extracts using anti-FLAG antibody only 
when HDACl-F is coexpressed. Furthermore, p48-HA is specifically retained by NP*- 
NTA in the presence of mSin3A-H (Figure 12C). Co-expression of p48-HA did not 
appear to eflfect the association between HDACl-F and mSin3A-H, suggesting that the 
regions of interaction are distinct and that all three proteins can associate simultaneously. 

25 These data suggest a direct interaction between mSin3A, HDACl and RbAp48 in vivo. 

(iii) Transcription repression by mSin3 A requires histone deacetylase activity. 

To investigate whether histone deacetylation plays a role in mSin3A-mediated 
transcriptional repression in vivo, we examined mSin3A-specific repression in the 
30 presence and absence of the histone deacetylase inhibitor trapoxin. 293 cells were 
transfected with a luciferase reporter gene construct containing a minimal promoter 
consisting of nly a TATA box and initiation site derived from the myelomonocj/tic 
growth factor gene (Figure 13A). This reporter has four consensus binding sites for the 
DNA binding domain of the S. cerevisiae transcriptional activator GAL4 and therefore is 
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responsive to chimeric proteins containing the GAL4 DNA binding domain (GALDBD) 
(Stemeck et al., 1992). We have used this reporter constiuct previously to demonstrate 
that fusion of the SID repressor region of Madl to the GALDBD is necessary for 
mSin3A-dependent transcriptional repression. Furthermore, we have shown that fusion 
5 of SK) to the potent transcriptional activator GALVP16, MadN35GALVP16, can cancel 
the activation function of VPI6 in an mSin3 A-dependent manner. Consistent with our 
previous results (Ayer et al., 1996), MadN35GALVP16 activated transcription from the 
reporter gene approximately 100-fold less well than GALVP16 (data not shown). As a 
negative control we engineered two proline substitutions into the SID of Madl, 

10 Mad(Pro); this protein cannot bind mSin3 A in vitro (Ayer et al,, 1995). Consistent with 
an inability to interact with mSinSA, Mad(Pro)GALVP16 is a much less potent repressor 
(Figure 13B). In control experiments we have shown that the observed effects require 
the presence of GAM sites in the promoter and that both MadN35GALVP16 and 
Mad(Pro)N35GALVP16 are expressed to equivalent levels in these cells and bind GAM 

15 sites with similar afiSnities (data not shown). To test the role of histone deacetylation on 
the repression observed in our transfection assays, we first examined the effect of 
trapoxin on histone deacetylase activity in 293 cells. As expected, in vivo treatment with 
10 nM trapoxin for eight hours reduced deacetylase activity of both crude 293 extracts 
and anti-HDACl immunopurified complexes by approximately 46% and 58%, 

20 respectively (Figure 13C). 

To test the effect of a histone deacetylase inhibitor on MadN35GALVP16 and 
Mad(Pro)N35GALVPI6 mediated repression, we treated a duplicate set of transfections 
with 10 nM trapoxin for eight hours prior to harvest. In the representative experiment 
shown, 10 nM trapoxin treatment derepressed the activity of MadN35GALVP16 nine- 

25 fold while it had little effect on the activity of Mad(Pro)N35GALVIfl6, suggesting that 
the histone deacetylation plays a direct role in mSin3 A transcriptional repression (Figure 
13B). In addition, there was typically less than a two-fold effect of trapoxin on the 
activity of the reporter cene in cells transfected with the expression vector alone or in 
cells transfected with GALVP16 (data not shown). Following trapoxin treatment, the 

30 repression observed for MadN35GALVP16 was still seven times greater than that of 
Mad(Pro)N35GALVP16, suggesting that the residual deacetylase activity following 
trapoxin treatment (Figure 13B) continues to drive mSin3A»mediatcd repression; 
however, we can not rule out that mSin3A is capable of repression by mechanisms 
independent of histone deacetylation. 



35 
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Discussion 

Earlier studies implicated mSin3 as the primary candidate for the negative 
transcriptional function of the DNA binding transcription factor Mad (Ayer et al,, 1995; 
Schreiber-Agus et al. 1995; Ayer et al., 1996). We present evidence that the mSinSA 
5 compressor is part of a high molecular weight, multicomponent complex(es) that contains 
active histone deacetylase, thereby implicating histone deacetylation as a potential 
mechanism for mSin3A-mediated repression. Furthermore, we observe a pronounced 
increase in the transcriptional activity of an mSinSA-silenced reporter gene upon 
treatment in vivo with the specific histone deacetylase inhibitor trapoxin, suggesting that 
10 fiill transcriptional repression by mSinSA requires histone deacetylation. These results 
suggest a mechanism of gene regulation through the targeting of an en/yme that alters 
chromatin structure. 

These observations are consistent with genetic experiments in yeast, suggesting 
that the yeast orthologs of mSinSA and HDACl. SIN3 and RPD3 respectively, are 
15 epistatic transcriptional regulators (Stillman et al., 1994). Furthermore, recent 
biochemical evidence demonstrates that PpdSp is a component of a large molecular 
weight histone deacetylase complex in yeast (Rundlett et al., 1996). Together with our 
results, these findings predict a conservation of the mSin3/HDACl fiinctional association 
in yeast. 

20 We have used chimeric transcriptional regulators to discern the eflfects of trapoxin 

on the activity of our reporter genes. The MadN35GALVP16 diimera fiinctioned as a 
repressor by a mechanism that was dependent on the binding of mSin3A and that was 
sensitive to trapoxin. The same mutations that inactivate MadN35GALVP16 as a 
transcriptional repressor (i e. Mad(Pro)N35GALVP16), also block interaction between 

25 Madl and mSin3A in vitro and Madl fijnction in vivo. Therefore, it is likely that 
Mad;Max heterocomplexes repress transcription in a maimer dependent on an mSinSA- 
associated histone deacetylase. 

By co-immunoprecipitation we have demonstrated that mSin3A and HDACl 
associate in vivo. Consistent with these data we observed nuclear colocalization of 
30 mSin3A and HDACl by immunofluorescence microscopy (data not shown). Finally, 
overexpression in insect cells facilitates co-purification of mSin3A and HDACl (Figure 
12B and C), suggesting that the interaction between mSin3A and HDACl is cither direct 
or requires a conserved cofactor. The finding that mSinSA has diflFerent associated 
histone deacetylases (HDACl and HDAC2) suggests that the mSin3A complex(es) may 
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have multiple substrate or target specificities. The heterogeneous nature of the mSin3A 
complex potentially reflects a diverse array of repressors, histone deacetylases and 
different targeting molecules that facilitate mSinSA-dependent alterations in gene 
expression. 

5 At least five additional polypeptides are stably associated with mSin3A (Figures 

10 and 11); whose function is currently unknown but tight association with mSin3A in 
both U937 cells and Jurkat T cells (data not shown) suggests that they in some way 
mediate mSin3A fianction. Furthermore, we have identified an association between 
mSinSA and RbAp48 in vivo, suggesting that this protein may play a role in regulating 

10 mSin3A-targeted deacetylation. RbAp48 was originally identified as a retinoblastoma 
binding protein that contains WD repeats and shares homology with the -subunit of G- 
proteins (Qian et al,, 1993). Subsequently, it has been shown that RbAp48 or its 
orthologs are involved in targeting different histone modifying enzymes to chromatin 
(Parthun et al., 1996; Taunton et al., 1996b; Tyler et al., 1996; Verreault et al., 1996). 

15 The mSin3A/RbAp48 complex isolated firom U937 cells (Figure 1 1) is likely to represent 
only a small firaction of the mSin3 A complexes, but its detection implies that mSin3A may 
play a role in the control of different aspects of chromatin physiology as well as 
transcription repression. 

It is unclear how different chromatin states facilitate transcription repression and 

20 activation or how their distinct biochemical states arise; however, there is ample 
cytological, genetic and biochemical evidence supporting the model that hyperacetylated 
chromatin is transcriptionally more active than hypoacetylated chromatin, Acetylation 
levels in -heterochromatin of Drosophila meUmogaster polytene chromosomes are 
significantly reduced at lysine positions 5, 8, and 16 of histone H4, while the 

25 transcriptionally hyperactive X-chromosome of male flies is uniquely hyperacetylated at 
position 16 (Turner et al., 1992). In yeast, mutation of acetyl-accepting lysines in histone 
H4 reduces the activity of the GAU, PHOS and CUPI promoters in vivo (Durrin et ai,, 
1991). The transcriptionally silent regions in yeast, HML and HMR, are hypoacetylated 
and their activation is correlated with acetylation of histone H4 (Braunstein et al„ 1993). 

30 Additionally, biochemical studies showed that certain transcription factors have higher 
affinity for their binding sites when those sites are embedded in chromatin assembled fi'om 
hyperacetylated histones (Lee et al., 1993; Vettes-Dadey al., 1996). Finally, evidence 
suggesting that acetylation is required for activation comes fi'om the recent demonstration 
that several coactivators either encode acetyltransferases or are associated with 

35 acetyltransferases. Thus, our data support this general model for the control of gene 
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expression by histone acetylation status and provide a biochemical mechanism for 
deacetylation-mediated repression. 

The acetylalion status of a particular chromatin region represents a balance 
between competing, acetylation and deacetylation reactions. We propose that 
5 MadN35GALVPI6 recruits mSin3A-HDAC complexes to specific sites on DNA and 
shifts this equilibrium towards deacetylation and subsequent transcription repression by 
creating a high effective molarity of the histone deacetylase. In yeast, the activation 
domain of VP 16 has been shown to use the acetyltransferase GenSp as a coacisvator 
(Marcus et al., 1994; Brownell et al., 1996), suggesting that in mammalian cells VP 16 
10 will also use an acetyltransferase as a cofactor. Thus, trapoxin treatment could shift the 
equilibrium from deacetylation to acetylation and thereby drive activation. 

Whether histone deacetylation will always have a negative eflFect on gene 
expression is unclear. Mutants in SINS and RPD3 can have both positive and negative 
effects on gene expression (Vidal and Gaber, 1991; Yoshiraoto et al., 1992); however, 

15 for SIN3 there is evidence that positive effects may be indirect (Wang et al., 1994). In 
addition, mutations or deletions in RPD3 have recently been shown to enhance telomeric 
silencing both in yeast and in fruit fly (Sussel et al., 1995; De Rubertis et al., 1996; 
Rundlert et al., 1996). In mammalian cells, deacetylase inhibitors can inhibit MyoD- 
(Johnston et al., 1992) and steroid receptor-activated transcription (McKnight et al., 

20 1990; Bresnick et al., 1990). While it remains to be shown that the effects of RPD3 on 
silencing are direct, this evidence suggests that histone deacetylation can elicit both 
positive and negative effects on gene expression. Determining the factors that govern the 
functional outcome of histone deacetylation will provide fertile ground for further 
experimentation. 

25 

Experimental Procedures 

Antibodies, cell culture, and Immunoprecipitations: To generate antiserum 
spedfic for mSin3A a GST fusion protein encoding, amino acids 251 through 405 of 
mSin3A was used to immunize a New Zealand WMte rabbit. The crude serum was 
30 passed over a GST column to remove the anti-GST antibodies. U937 cells were grown 
in RPMI supplemented with 10% calf serum (Hyclone), glutamine and penicillin- 
streptomycin. Low and high stringency immunoprecipitations were performed essentially 
as described (Ayer and Eisenman, 1993). To elute proteins from low stringency 
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immunoprecipitates, they were incubated for 60 minutes at room temperature in antibody 
buffer and reprecipitated under high stringency conditions. 

Luciferase assays: 293 cells were seeded in triplicate onto 60 mm dishes at 3x10^ 
cells in 4 ml DME with 10% calf serum (Hyclone). Six hours after seeding, cells were 
5 transfected with 50 ng luciferase reporter, 50 ng CMV-p-gal, 50 ng expression construct, 
and 2.85 ^g carrier DNA using the BBS/CaPO^ method. 10 nM trapoxin was added to 
the media 8 hours prior to the luciferase assays. Cell lysates were preparcd^-24 hours 
following transfections, and luciferase and p-galactosidase activities was assayed 
according to manufacturer directions (Promega, Tropix). Luciferase values (relative light 
10 units) were normalized for transfection efficiency by dividing by P-gal activity. 

Histone deacetylase assays: In vitro histone deacetylase activity was assayed 
essentially as described with either 50 ^1 of crude cell extract (approidmately 5 X 10* 
Cells) or inununopurified cell extracts (approximately 2 X 10^ cells) for 2.5 hrs at 37 C 
(Taunton et al., 1996b). Pretreatment of crude or immunopurified extract with synthetic 

15 trapoxin was performed for 30 minutes at 4 C prior to addition of peptide substrate. TAg 
Jurkat and 293 cell extracts for histone deacetylase assays were prepared as in Taunton et 
al., 1996. Anti-HDACl and anti-mSin3A inununoprecipitations were performed as 
described above and in Figure 1 1 . The Protein-A conjugated immunoprecipitates were 
washed three times in J-bufFer plus I mM EDTA and resuspended in J-buffer without 

20 Triton-X-100, and histone deacetylase activity was measured as described. 

Baculoviruses: cDNAs encoding Flag-tagged HDACl, HA-tagged RbAp48 and 
His-tagged mSin3A were cloned into the transfer vector pVL 1392 (specific details on 
the construction of these vectors is available upon request). Recombinant virus was 
generated using Baculogold DNA according to the manufactures instructions 
25 (Pharminigen). Sf9 or Mgh 5 cells were infected at high multiplicity, extracts prepared 
48 hours post infection and inununoprecipitations performed as described above. Ni^*- 
NTA agarose and anti-Flag antibody were purchased fi-om Qiagen and Kodak-IBr, 
respectively. 
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10 



Example 4 

Since the priority date of this application, a number of other mammalian HDx 
genes have been descHBSi in the literature. In particular, a mouse HDl clone is 
identified in GenBank as accession number U807080. Another HDx member, HD2 
(HDAC-2) is also described for both human and mouse; see for example, 
entries U31814 and U31758. Without exception, each clone includes /a^ v motiflfj 
represented in the general formula of SEQ ID No. 12, an^Pa x motif r^resent 
general formula SEQ ID No. 14. 



All of the above-cited references and publications are hereby incorporated by 
15 reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, numerous equivalents to the specific polypeptides, nucleic adds, 
methods, assays and reagents described herein. Such equivalents are considered to be 
20 within the scope of this invention and are covered by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Schreiber, Stuart L. 

Taunton, Jack 
Haosig, Christian A. 
Jamison, Timothy F. 

10 

"""("iiT' TITLE OF INVENTION: Histdne Deacetylasae and Uses Related 
Thereto 



(iii) NUMBER OF SEQUENCES: 15 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FOLEY, HOAG ELIOT, LLP 

(B) STREET: One Post Office Square 

(C) CITY: Boston 
20 (D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCI I (text) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 26-MAR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Vincent, Matthew P. 

(B) REGISTRATION NUMBER: 36,709 

(C) REFERENCE/DOCKET NUMBER: HUV019.25 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 832-1000 

(B) TELEFAX: (617) 832-7000 



45 <2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 
50 (C) STPJiNDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,1446 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GCG GAG ACG CAG GGC ACC CGG AGG AAA GTC TGT TAC TAG TAG GAC 48 
10 Met Ala Gin Thr Gln Cly Thr Arg Arg Lyo Val Cys Tyr Tyr Tyr Asp 
1 S 10 15 

GGG GAT GTT GGA AAT TAC TAT TAT GCA CAA GGC CAC CCA ATG AAG CCT 96 
Gly Asp Val Gly Aan Tyr Tyr Tyr Gly Gin Gly Hie Pro Met Lya Pro 
15 20 25 30 

CAC CGA ATC CGC ATG ACT CAT AAT TTG CTG CTC AAC TAT GGT CTC TAC 144 
His Arg lie Arg Met Thr His Aan Leu Leu Leu Asn Tyr Gly Leu Tyr 
35 40 45 

20 

CGA AAA ATG GAA ATC TAT CGC CCT CAC AAA GCC AAT GCT GAG GAG ATG 192 
Arg Lye Met Glu He Tyr Arg Pro Hie Lys Ala Aan Ala Glu Glu Met 
50 55 60 

25 ACC AAG TAC CAC AGC GAT GAC TAC ATT AAA TTC TTG CGC TCC ATC CGT 240 
Thr Lys Tyr His Ser Asp Asp Tyr He Lya Phe Leu Arg Ser He Arg 
65 70 75 80 

CCA GAT AAC ATG TC6 GAG TAC AGC AAG CAG ATG CAG AGA TTC AAC GTT 288 
30 Pro Asp Asn Met Ser Glu Tyr Ser Lys Gin Met Gin Arg Phe Asn Val 

85 90 95 

GGT GAG GAC TGT CCA GTA TTC GAT GGC CTG TTT GAG TTC TGT CAG TTG 336 
Gly Glu Asp Cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cys Gin Leu 
35 100 105 110 

TCT ACT GGT GGT TCT GTG GCA ACT GCT GTG AAA CTT AAT AAG CAG CAG 384 

Ser Thr Gly Gly Ser Val Ala Ser Ala Val Lys Leu Asn Lys Gin Gin 
115 120 125 

40 

ACG GAC ATC GCT GTG AAT TGG GCT GGG GGG CTG CAC CAT GCA AAG AAG 432 

Thr Asp He Ala Val Asn Trp Ala Gly Gly Leu His His Ala Lys Lys 

130 135 140 

45 TCC GAG GCA TCT GGC TTC TGT TAC GTC AAT GAT ATC GTC TTG GCC ATC 480 
Ser Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp He Val Leu Ala He 
145 150 155 160 

CTG GAA CTG CTA AAG TAT CAC CAG AGG GTG CTG TAC ATT GAC ATT GAT 528 
50 Leu Glu Leu Leu Lys Tyr His Gin Arg Val Leu Tyr He Asp He Asp 

165 170 175 



ATT CAC CAT GGT GAC GGC GTG GAA GAG GCC TTC TAC ACC ACG GAC CGG 
He His Hie Gly Asp Gly Val Glu Glu Ala Phe Tyr Thr Thr Asp Arg 
55 180 185 190 



576 
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GTC ATG ACT GTG TCC TTT CAT AAG TAT GGA GAG TAG TTC CCA GGA ACT 624 

Val Met Thr Val Ser Phe His Lys Tyr Gly Glu Tyr Phe Pro Gly Thr 
195 200 205 

5 

GGG GAC OTA CGG GAT ATC GGG GCT GGC AAA GGC AAG TAT TAT GCT GTT 672 

Gly Asp Leu Arg Asp lie Gly Ala Gly Lys Gly Lys Tyr Tyr Ala Val 

210 215 220 

10-._AAC TAG CCG CTC OGA GAC GGG ATT GAT GAC GAG TCC TAT GAG GCC ATT 720 
Asn Tyr Pro Leu Arg Asp Gly He Asp Asp Glu Ser Tyr Glu Ala He 
225 230 235 240 

TTC AAG CCG GTC ATG TCC AAA GTA ATG GAG ATG TTC CAG CCT AGT GCG 768 
15 Phe Lys Pro Val Met Ser Lys Val Met Glu Met Phe Gin Pro Ser Ala 

245 250 255 

GTG GTC TTA CAG TGT GGC TCA GAC TCC CTA TCT GGG GAT CGG TTA GOT 816 
Val Val Leu Gin Cys Gly Ser Asp Ser Leu Ser Gly Asp Arg Leu Gly 
20 260 265 270 

TGC TTC AAT CTA ACT ATC AAA GGA CAC GCC AAG TGT GTG GAA TTT GTC B64 
Cys Phe Asn Leu Thr He Lys Gly His Ala Lys Cys Val Glu Phe Val 
275 280 285 



25 



35 



45 



50 



55 



AAG AGO TTT AAC CTG CCT ATG CTG ATG CTG GGA GGC CGT GGT TAC ACC 912 
Lys Ser Phe Asn Leu Pro Met I^u Met Leu Gly Gly Gly Gly Tyr Thr 
290 295 300 



30 ATT CGT AAC GTT GCC CGG TGC TGG ACA TAT GAG ACA GCT GTG GCC CTG 
He Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Val Ala Leu 
305 310 315 320 



960 



GAT ACG GAG ATC CCT AAT GAG CTT CCA TAC AAT GAC TAC TTT GAA TAC 1008 
Asp Thr Glu He Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr 
325 330 335 



TTT GGA CCA GAT TTC AAG CTC CAC ATC AGT CCT TCC AAT ATG ACT AAC 1056 
Phe Gly Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn 
40 340 345 350 



CAG AAC ACG AAT GAG TAC CTG GAG AAG ATC AAA CAG CGA CTG TTT GAG 1104 
Gin Asn Thr Asn Glu Tyr Leu Glu Lys He Lys Gin Arg Leu Phe Glu 
355 360 365 

AAC CTT AGA ATG CTC CCG CAC GCA CCT GGG GTC CAA ATG CAG CCG ATT 1152 
Asn Leu Arg Met Leu Pro His Ala Pro Gly Val Gin Met Gin Ala He 
370 375 380 

CCT GAG GAC GCC ATC CCT GAG GAG AGT GGC GAT GAG CAC GAA GAC GAC 1200 
Pro Glu Asp Ala He Pro Glu Glu Ser Gly Asp Glu Asp Glu Asp Asp 
385 390 395 400 

CCT GAC AAG CGC ATC TCG ATC TGC TCC TCT GAC AAA CGA ATT GCC TGT 1248 
Pro Asp Lys Arg He Ser He Cys S r Ser Asp Lys Arg He Ala Cys 
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GAG GAA GAG TTC TCC GAT TCT GAA 
Glu Glu Glu Phe Ser Asp Ser Glu 
5 420 
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410 415 

GAG GAG GGA GAG GGG GGC CGC AAG 1296 
Glu Glu Gly Glu Gly Gly Arg Lys 
425 430 



AAC TCT TCC AAC TTC AAA AAA GCC AAG AGA GTC AAA ACA GAG GAT GAA 1344 
Aen Ser Ser Asn Phe Lys Lya Ala Lys Arg Val Lys Thr Glu Asp Glu 
435 440 445 

10 

AAA GAG AAA GAC CCA GAG GAG AAG AAA GAA GTC ACC GAA GAG GAG AAA 1392 
Lya Glu Lys Asp Pro Glu Glu Lys Lys Glu Val Thr Glu Glu Glu Lys 
450 455 460 



ACC AAG GAG GAG AAG CCA GAA GCC AAA GGG GTC AAG GAG GAG GTC AAG 1440 
Thr Lye Glu Glu Lys Pro Glu Ala Lys Gly Val Lye Glu Glu Val Lys 
465 470 475 480 

TTG GCC TGA 1449 
Leu Ala 



(2) INFORMATION FOR SEQ ID NO: 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATTGACTTCC TGCAGAGAGT CAGCCCCACC AATATGCAAG GCTTCACCAA GAGTCTTAAT 60 

GCCTTCAACG TAGGCGATGA CTGCCCAGTG TTTCCCGGGC TCTTTGAGTT CTGCTCGCGT 120 

TACACAGGCG CATCTCTGCA AGGAGCAACC CAGCTGAACA ACAAGATCTG TGATATTGCC 180 

ATTAACTGGG CTGGTGGTCT GCACCATGCC TAGAAGTTTG AGGCCTCTGG CTTCTGCTAT 240 

GTCAACGACA TTGTGTTTGG CATCCTGGAG CTGCTCAAGT ACCACCCTCG GGTGCTCTAC 300 

ATTGACATTG ACATCCACCA TGGTGACGGG GTTCAAGAAG CTTTCTACCT CACTGACCGG 360 

GTCATGACGG TGTCCTTTC 379 

(2} INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

10 

TACTACTGTC TGAACGTGCC CCTGCGGATG GGCATTGATG ACCAGAGTTA CAAGCACCTT 60 
TTCCAGCCGG TTATCAACCA GGTAGTGGAC TTCTACCAAC CCACGTGCAT TGTGCrCCAG 120 
15 TGTGGAGCTG ACTCTCTGGG CTGTGATCGA TTGGGCTGCT TTAACCTCAG CATCCGAGGG 180 
CATGGGGAAT GCGTTGAATA TGTCAACAGC TTCAATATCC CTCTACTCGT GCTGGGTGGT 240 
GGTGGTTATA CTGTCCGAAA TGTTGCCCGC TGCTGGACAT ATGAGACATC GCTGCTGGTA 300 

20 

GAAGAGGCCA TTAGTGAGGA GCTTCCCTAT AGTGAATACT TCGAGTACTT TGCCCCAGAC 360 
TTCACACTTC ATCCA 375 
25 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 ba0e pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGTCATGCTA AATGTGTAGA AGTTGTAAAA ACTTTTAACT TACCATTACT GATGCTTGGA 60 

GGAGGTGGCT ACACAATCCG TAATGTTGCT CGATGTTGGA CATATGAGAC TGCAGTTGCC 120 

CTTGATTGTG AGATTCCCAA TGAGTTGCCA TATAATGATT ACTTTGAGTA TTTTGGACCA 180 

45 GACTTCAAAC TGCATATTAG TCCTTCAAAC ATGACAAACC AGAACAC 227 

(2) INFORMATION FOR SEQ ID NO: 5: 

50 (1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 482 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Gin Thr Gin Gly Thr Arg Arg Lys Val Cya Tyr Tyr Tyr Asp 
5 1 5 10 15 

Gly Asp Val Gly Asn Tyr Tyr Tyr Gly Gin Gly His Pro Met Lye Pro 
20 25 30 

10 Hie Arg lie Arg Met Thr His Aan Leu Leu Leu Asn Tyr Gly Leu Tyr 
35 40 45 

Arg Lye Met Glu lie Tyr Arg Pro His Lys Ala Asn Ala Glu Glu Met 
50 55 60 

15 

Thr Lys Tyr His Ser Asp Asp Tyr lie Lys Phe Leu Arg Ser lie Arg 
65 70 75 80 

Pro Asp Asn Met Ser Glu Tyr Ser Lys Gin Met Gin Arg Phe Asn Val 
20 85 90 95 

Gly Glu Asp Cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cye Gin Leu 
100 105 110 

25 Ser Thr Gly Gly Ser Val Ala Ser Ala Val Lye Leu Asn Lys Gin Gin 
115 120 125 

Thr Asp lie Ala Val Asn Trp Ala Gly Gly Leu His His Ala Lys Lys 
130 135 140 

30 

Ser Glu Ala Ser Gly Phe Cye Tyr Val Asn Asp lie Val Leu Ala lie 
145 150 155 160 

Leu Glu Leu Leu Lys Tyr His Gin Arg Val Leu Tyr He Asp He Asp 
35 165 170 175 

lie His His Gly Asp Gly Val Glu Glu Ala Phe Tyr Thr Thr Asp Arg 
180 185 190 

40 Val Met Thr Val Ser Phe Hie Lys Tyr Gly Glu Tyr Phe Pro Gly Thr 
195 200 205 

Gly Asp Leu Arg Asp He Gly Ala Gly Lys Gly Lys Tyr Tyr Ala Val 
210 215 220 

45 

Aon Tyr Pro Leu Arg Aap Gly He Asp Aop Glu Ser Tyr Glu Ala He 
225 230 235 240 

Phe Lys Pro Val Met Ser Lye Val Met Glu Met Phe Gin Pro Ser Ala 
50 245 250 255 

Val Val Leu Gin Cys Gly Ser Aap Ser Leu Ser Gly Asp Arg Leu Gly 
260 265 270 



55 



Cys Phe Aan Leu Thr He Lye Gly His Ala Lys Cys Val Glu Phe Val 
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275 280 285 

Lys Ser Phe Asn Leu Pro Met Leu Met Leu Gly Gly Gly Gly Tyr Thr 
290 295 300 

5 

lie Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Val Ala Leu 
305 310 315 320 

Asp Thr Glu He Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr 
10 325 330 . 335 

Phe Gly Pro Asp Phe Lys Leu Hie He Ser Pro Ser Asn Met Thr Asn 
340 345 350 

15 Gin Asn Thr Asn Glu Tyr Leu Glu Lys He Lys Gin Arg Leu Phe Glu 
355 360 365 

Asn Leu Arg Met Leu Pro His Ala Pro Gly Val Gin Met Gin Ala He 
370 375 380 

20 

Pro Glu Asp Ala He Pro Glu Glu Ser Gly Asp Glu Asp Glu Aep Asp 
385 390 395 400 

Pro Asp Lys Arg He Ser He Cys Ser Ser Aep Lys Arg He Ala Cys 
25 405 410 415 

Glu Glu Glu Phe Ser Asp Ser Glu Glu Glu Gly Glu Gly Gly Arg Lys 
420 425 430 

30 Asn ser Ser Asn Phe Lys Lys Ala Lys Arg Val Lys Thr Glu Asp Glu 
435 440 445 

Lys Glu Lys Asp Pro Glu Glu Lys Lys Glu Val Thr Glu Glu Glu Lys 
450 455 460 

35 

Thr Lys Glu Glu Lys Pro Glu Ala Lys Gly Val Lys Glu Glu Val Lys 
465 470 475 480 

Leu Ala 

40 

<2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Ile Asp Phe Leu Gin Arg Val Ser Pro Thr Asn Met Gin Gly Phe Thr 
15 10 15 

Lye Ser Leu Asn Ala Phe Asn Val Gly Asp Asp Cys Pro Val Phe Pro 
5 20 25 30 

Gly Leu Phe Glu Phe Cys Ser Arg Tyr Thr Gly Ala Ser Leu Gin Gly 
35 40 45 

10 Ala Thr Gin Leu Asn Asn Lya He Cys Asp He Ala He Asn Trp Ala 

50 55 60 



15 



Gly Gly Leu His His Ala Lys Lys Phe Glu Ala Ser Gly Phe Cys Tyr 
65 70 75 80 

Val Asn Asp He Val Phe Gly He Leu Glu Leu Leu Lya Tyr His Pro 
85 90 95 



Arg Val Leu Tyr He Asp He Asp He His His Gly Asp Gly Val Gin 
20 100 105 110 

Glu Ala Phe Tyr Leu Thr Asp Arg Val Met Thr Val Ser Phe Pro Gin 
115 120 125 

25 He Arg Glu He Tyr 

130 

(2) INFORMATION FOR SEQ ID NO: 7: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Tyr Tyr Cys Leu Asn Val Pro Leu Arg Met Gly He h&p Asp Gin Ser 
15 10 15 

45 

Tyr Lys His Leu Phe Gin Pro Val He Asn Gin Val Val Asp Phe Tyr 
20 25 30 

Gin Pro Thr Cys He Val Leu Gin Cys Gly Ala Asp Ser Leu Gly Cys 
50 35 40 45 

Asp Arg Leu Gly Cys Phe Asn Leu Ser He Arg Gly Hia Gly Glu Cys 
50 55 60 



55 



Val Glu Tyr Val Lye Ser Phe Asn He Pro Leu Leu Val Leu Gly Gly 
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65 70 75 80 

Gly Gly Tyr Thr Val Arg Aan Val Ala Arg Cys Trp Thr Tyr Glu Thr 
85 90 95 

5 

Ser Leu Leu Val Glu Glu Ala lie Set Glu Glu Leu Pro Tyr Ser Glu 
100 105 110 

Tyr Phe Glu Tyr Phe Ala Pro Asp Phe Thr Leu His Pro 
10 115 120 . 125 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



35 



(ii) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: internal 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Aen Leu Leu Val Leu Gly His Ala Lye Cys Val Glu Val Val Lys Thr 
15 10 15 

30 Phe Asn Leu Pro Leu Leu Met Leu Gly Gly Gly Gly Tyr Thr He Arg 

20 25 30 



Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Val Ala Leu Asp Cys 
35 40 45 

Glu He Pro Aan Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr Phe Gly 
50 55 60 



Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn Gin Asn 
40 65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 11: 

45 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1275 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
55 (A) NAME/KEY: CDS 
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(B) LOCATION: 1. .1275 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG GCC GAC AAG GAA GCA GCC TTC GAC GAC GCA GTG GAA GAA CGA GTG 48 
Met Ala Asp Lys Olu Ala Ala Phe Asp Asp Ala Val Glu Glu Arg Val 
15 10 15 

ATC AAC GAG GAA TAC AAA ATA TGG AAA AAG AAC ACC OCT TTT CTT TAT 96 
lie Asn Glu Glu Tyr Lye He Trp Lys Lys Asn Thr Pro Phe I^eu'^Tyx:" 
20 25 30 

GAT TTG GTG ATG ACC CAT GCT CTG GAG TGG CCC AGC CTA ACT GCC CAG 144 
Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 
35 40 45 

TGG CTT CCA GAT GTA ACC AGA CCA GAA GGG AAA GAT TTC AGC ATT CAT 192 
Trp Leu Pro Asp Val Thr Arg Pro Glu Gly Lys Asp Phe Ser He His 
50 55 60 

CGA CTT GTC CTG GGG ACA CAC ACA TCG GAT GAA CAA AAC CAT CTT GTT 240 
Arg I*eu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn Hie Leu Val 
65 70 75 80 

ATA GCC AGT GTG CAG CTC CCT AAT GAT GAT GCT CAG TTT GAT GCG TCA 288 
He Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 
85 90 95 

CAC TAC GAC AGT GAG AAA GGA GAA TTT GGA GGT TTT GGT TCA GTT AGT 336 
Hie Tyr Asp Ser Glu Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Ser 
100 105 110 

GGA AAA ATT GAA ATA GAA ATC AAG ATC AAC CAT GAA GGA GAA GTA AAC 384 
Gly Lys He Glu He Glu He Lys He Asn His Glu Gly Glu Val Asn 
115 120 125 

AGG GCC CGT TAT ATG CCC CAG AAC CCT TGT ATC ATC GCA ACA AAG ACT 432 
Arg Ala Arg Tyr Met Pro Gin Asn Pro Cys He He Ala Thr Lys Thr 
130 135 140 

CCT TCC AGT GAT GTT CTT GTC TTT GAC TAT ACA AAA CAT CCT TCT AAA 480 
Pro Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 
145 150 155 160 

CCA GAT CCT TCT GGA GAG TGC AAC CCA GAC TTG CGT CTC CGT GGA CAT 528 
Pro Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly Hie 
165 170 175 

I CAG AAG GAA GGC TAT GGG CTT TCT TGG AAC CCA AAT CTC AGT GGG CAC 576 
Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 
180 185 190 

TTA CTT AGT GCT TCA GAT GAC CAT ACC ATC TGC CTG TGG GAC ATC AGT 624 
i hBxx L u Ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp He Ser 
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40 



50 
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195 200 205 



GCC GTT CCA AAG GAG GGA AAA GTG GTA GAT GCG AAG ACC ATC TTT ACA 672 
Ala Val Pro Lye Glu Gly Lys Val Val Asp Ala Lye Thr lie Phe Thr 
5 210 215 220 



GGG CAT ACG GCA GTA GTA GAA GAT GTT TCC TGG CAT CTA CTC CAT GAG 720 
Gly HiB Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 
225 230 235 240 

TCT CTG TTT GGG TCA GTT GCT GAT GAT CAG AAA- GTT ATG ATT TGG GAT 768 
Ser Leu Phe Gly Ser Val Ala Aap Asp Gin Lys Leu Met lie Trp Asp 
245 250 255 

15 ACT CGT TCA AAC AAT ACT TCC AAA CCA AGC CAC TCA GTT GAT GCT CAC 816 
Thr Arg Ser Aan Aon Thr Ser Lys Pro Ser His Ser Val Aep Ala His 
260 265 270 

ACT GCT GAA GTG AAC TGC CTT TCT TTC AAT CCT TAT ACT GAG TTC ATT 664 
20 Thr Ala Glu Val Aan Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe lie 
275 280 285 

CTT GCC ACA GGA TCA GCT GAC AAG ACT GTT GCC TTG TGG GAT CTG AGA 912 
Leu Ala Thr Gly Ser Ala Aap Lye Thr Val Ala Leu Trp Aap Leu Arg 
25 290 295 300 

AAT CTG AAA CTT AAG TTG CAT TCC TTT GAG TCA CAT AAG GAT GAA ATA 960 
Asn Leu Lys Leu Lys Leu His Ser Phe Glu Ser His Lye Asp Glu He 
305 310 315 320 

30 

TTC CAG GTT CAG TGG TCA CCT CAC AAT GAG ACT ATT TTA GCT TCC AGT 1008 
Phe Gin Val Gin Trp Ser Pro Hie Asn Glu Thr lie Leu Ala Ser Ser 
325 330 335 

35 GGT ACT GAT CGC AGA CTG AAT GTC TGG GAT TTA AGT AAA ATT GGA GAG 1056 
Gly Thr Asp Arg Arg Leu Asn Val Trp Asp Leu Ser Lys He Gly Glu 
340 345 350 



GAA CAA TCC CCA GAA GAT GCA GAA GAC GGG CCA CCA GAG TTG TTG TTT 1104 
Glu Gin Ser Pro Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe 
355 360 365 

ATT CAT GGT GGT CAT ACT GCC AAG ATA TCT GAT TTC TCC TGG AAT CCC 1152 
He His Gly Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro 
45 370 375 380 

AAT GAA CCT TGG GTG ATT TGT TCT GTA TCA GAA GAC AAT ATC ATG CAA 1200 
Aen Glu Pro Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin 
385 390 395 400 

GTG TGG CAA ATG GCA GAG AAC ATT TAT AAT GAT GAA GAC CCT GAA GGA 1248 
Val Trp Gin Met Ala Glu Asn He Tyr Aan Asp Glu Asp Pro Glu Gly 
405 410 415 

55 AGC GTG GAT CCA GAA GGA CAA GGG TCC TAG 1278 
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Ser Val Asp Pro Glu Gly Gin Gly Ser 
420 425 

(2) INFORMATION FOR SEQ ID NO: 12: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



10 



15 



(ii> MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



20 



Asp Xaa Xaa Xaa Asn Xaa Xaa Gly Gly Leu His His Ala Lya Lye Xaa 
15 10 15 

Glu Ala Ser Gly Phe Cys Tyr Xaa Asn Asp lie Val Xaa Xaa lie Xaa 
20 25 30 

Glu Leu Leu Xaa Tyr His Xaa Arg Val Xaa Tyr lie Asp Xaa Asp Xaa 
25 35 40 45 

His His Gly Asp Gly Xaa Glu Glu Ala Phe Tyr Xaa Thr Asp Arg Val 
50 55 60 

30 Met Thr Xaa Ser Phe 

65 

(2) INFORMATION FOR SEQ ID NO: 13: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE; internal 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp lie Ala Xaa Asn Trp Ala Gly Gly Leu His His Ala Lys Lya Xaa 
15 10 15 

50 Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp He Val Xaa Xaa He Leu 

20 25 30 



55 



Glu Leu Leu Lys Tyr His Xaa Arg Val Leu Tyr He Asp He Asp He 
35 40 45 
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Hia Hie Gly Asp Gly Xaa Glu Glu Ala Phe Tyr Xaa Thr Asp Arg Val 
50 55 60 

Met Thr Val Ser Phe 
5 65 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 33 amino acids 

(B) TYPE: amino acicl'" , * 
(D) TOPOLCMsyi linear 



15 



20 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Cye Val Xaa Xaa Xaa Lys Xaa Phe Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly 
15 10 15 

Gly Gly Gly Tyr Thr Xaa Arg Asn Val Ala Arg Xaa Trp Xaa Xaa Glu 
25 20 25 30 

Thr 



30 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 



Cys Val Glu Xaa Val Lye Xaa Phe Asn Xaa Pro Leu Leu Xaa Leu Gly 
45 1 5 10 15 

Gly Gly Gly Tyr Thr Xaa Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu 
20 25 30 

50 Thr 
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What is claimed is: 

1 . An isolated or recombinant HDx polypeptide. 

2. The polypeptide of claim 1, of mammalian origin. 

3. The polypeptide of claim 3, of human origin. 

5 4. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
sequence at least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

5. The polypeptide of claim I, which polypeptide comprises an HDx polypeptide 
sequence at least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

6. The polypeptide of claim 1, which polypeptide .comprises an HDx polypeptide 
10 sequence designated in SEQ ID No: 2. 

7. The polypeptide of claim I, which polypeptide is encoded by a nucleic acid having a 
coding sequence, or portion thereof, which hybridizes under stringent conditions to 
the nucleic acid designated in SEQ ID No. 1 . 

9. The polypeptide of claim 1^ which polypeptide is an acetylase activity. 

15 10. The polypeptide of claim 1, which polypeptide binds to an RbAp48 protein. 

1 1. The polypeptide of claim 1, which polypeptide is a fusion protein. 

12. The polypeptide of claim 1, which polypeptide has a molecular weight in the range 
of 45-70 Kd. 

13. An isolated or recombinant polypeptide comprising zxiHDx polypeptide sequence 
20 homologous or identical SEQ ID No, 2, or a fragment thereof which retains one or 

more of (i) a histone deacetylase activity, C") a histone binding activity and Oii) ^ 
RbAp48 binding activity. 

14. The polypeptide of claim 13, which polypeptide comprises an HDx sequence 
represented in the general formula 

25 DXXhWGGLHHAKKXEASGFCYXNDIVXXI- 

XELLXYHXRVXYIDXDXHHGDGXEAFY5aT)RVMT^ 

15. The polypeptide of claim 13, which polypeptide comprises an HDx sequence 
represented in the general formula 
CVXXXKXFXXPXXXXGGGGYTXRhTVARX-WXXET. 

30 16. The polypeptide of claim 13, which polypeptide deacetylates acetylated histones. 
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17. The polypeptide of claim 13, which polypeptide is a dominant negative inhibitor 
which antagonizes deacetylation of acetylated histones. 

18. The polypeptide of claim 13, which polypeptide comprises an HDx sequence at 
least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

5 19. The polypeptide of claim 13, which polypeptide comprises an HDx sequence at 
least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

20. The polypeptide of claim 13, which polypeptide includes at least 25 amino acid 
residues of an HDx polypeptide sequence. 

21. The polypeptide of claim 13, wherein said polypeptide modulates cellular 
10 proliferation. 

22. An isolated or recombinant polypeptide comprising an HDx polypeptide sequence 
represented SEQ ID No. 2, or a fragment thereof which retains one or more of (i) a 
histone deacetylase activity, (ii) a histone binding activity and (iii) an RbAp48 
binding activity. 

15 23. The polypeptide of claim 22, wherein said fusion protein includes, as a second 
polypeptide sequence, a polypeptide which functions as a detectable label for 
detecting the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fusion protein. 

24. The polypeptide of claim 13, wherein said polypeptide is a fusion protein further 
20 comprising, in addition to said HDx sequence, a second polypeptide sequence 

having an amino acid sequence unrelated to an HDx polypeptide sequence. 

25. A purified or recombinant HDx polypeptide encoded by a nucleic acid which 
hybridizes under stringent conditions to a nucleotide sequence designated m SEQ 
IDNo. 1. 

25 26. A purified or recombinant HDx polypeptide compri^ng a v motif represented in the 
8«n^ formula 
DIAX1NWAGGLHHAKKX2EASGFCYVNDIVX3X4ILELLKYH- 
X5RVLYIDIDIHHGDGX5EAFYX7TDRVMTVSF and a X motif represented in 
CVEXi\^2lT^3P-X4LX5LGGGGYTX6RNVARCWTYET. 

30 27. An isolated nucleic acid which encodes a deacetylase activity and hybridizes under 
stringent conditions to a nucleotide sequence designated in SEQ ID No. 1. 
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28. An isolated nucleic acid encoding an HDx polypeptide, which polypeptide 
specifically modulates histone acetylation. 

29. The nucleic acid of claim 28, which HDx polypeptide comprises a v motif 
represented in the general formula 

5 DIAXiNWAGGLHHAKKX2EASGFCY\TSIDIVX3X^ 

KYHX5RVLYIDIDIHHGDGX5EAFYX7TDRVMTVSF and a X motif 
represented in CVEJCi Vi6C2ITnIX3P-X4LX5LGGG^ 

30. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence at least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

10 31. The nucleic acid of claim 28. which HDx polypeptide comprises a polypeptide 
sequence at least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

32. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence designated in SEQ ID No: 2. 

33. The nucleic acid of claim 28, which HDx polypeptide has a molecular weight in the 
15 range of 45-70 Kd. 

34. The nucleic acid of claim 28, which HDx polypeptide is a fusion protein further 
comprismg, in addition to HDx polypeptide sequences, a second polypeptide 
sequence having an amino acid sequence unrelated to a nucleic acid sequence. 

35. The nucleic acid of claim 34, wherein said fusion protein includes, as a second 
20 polypeptide sequence, a polypeptide which functions as a detectable label for 

detecting the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fusion protein, 

36. The nucleic acid of claim 28, further comprising a transcriptional regulatory 
sequence operably linked to said nucleotide sequence so as to render said nucleic 

25 acid suitable for use as an expression vector. 

37. An expression vector, capable of replicating in at least one of a prokaryotic cell and 
eukaryotic cell, comprising the nucleic acid of claim 36. 

38. A host cell transfected with the expression vector of claim 37 and expressing said 
recombinant polypeptide. 

30 39. A method of producing a recombinant HDx polypeptide comprising culturing the 
cell of claim 38 in a cell culture medium to express said recombinant polypeptide 
and isolating said recombinant polypeptide from said cell culture. 
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40. A transgenic non-human animal having cells which harbor a heterologous transgene 
encoding an HDx polypeptide. 

41. A transgenic non-human animal having cells in which an HDx gene is disrupted. 

42. A recombinant transfection system, comprising 

5 (i) a gene construct including the nucleic acid of claim 28 and operably linked to a 

transcriptional regulatory sequence for causing expression of said HDx 
polypeptide in eukaryotic cells, and 

fii) a gene delivery composition for delivering said gene construct to a cell and 
causing the cell to be transfected with said gene construct. 

10 43. The recombinant transfection system of claim 42, wherein the gene delivery 
composition is seleaed from a group consisting of a recombinant viral particle, a 
liposome, and a poly-cationic nucleic acid binding agent. 

44. A nucleic acid composition comprising a substantially purified oligonucleotide, said 
oligonucleotide including a region of nucleotide sequence which hybridizes under 

15 stringent conditions to at least 25 consecutive nucleotides of sense or antisense 

sequence of an HDx gene. 

45. The nucleic acid composition of claim 44, which oligonucleotide hybridizes under 
stringent conditions to at least 50 consecutive nucleotides of sense or antisense 
sequenc of an HDx gene. 

20 46. The nucleic acid composition of claim 44, wherein said oligonucleotide further 
comprises a label group attached thereto and able to be detected. 

47. The nucleic acid composition of claim 44, wherein said oligonucleotide has at least 
one non-hydrolyzable bond between two adjacent nucleotide subunits. 

48. A test kit for detecting cells which contain an /iZ>x-encoding nucleic acid, 
25 comprising the nucleic acid composition of claim 44 for measuring, in a sample of 

cells, a level of nucleic acid encoding an HDx protein. 

49. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to /«>x-mediated histone deacetylation, comprising 
treating the cell with an effective amount of an agent which modulates the 

30 deacet>'lase activity of an HDx polypeptide thereby altering, relative to the cell in 

the absence of the agent, at least one of (i) rate of growth, (ii) diflFerentiation, or 
(iii) survival of the cell. 
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50. An antibody to an HDx polypeptide. 

51. The antibody of claim 50, wherein said antibody is monoclonal. 

52. A diagnostic assay for identifying a cell or cells at risk for a disorder characterized 
by unwanted cell proliferation or differentiation, comprising detecting, in a cell 

5 sample, the presence or absence of a genetic lesion characterized by at least one of 

(i) aberrant modification or mutation of a gene encoding an HDx protein, and fii) 
mis-expression of said gene; wherein a wild-type form of said gene encodes m HDx 
protein characterized by an ability to modulate the signal transduction activity of a 
TGFP receptor. 

10 53. The assay of claim 52, wherein detecting said lesion includes: 

i. providing a diagonistic probe comprising a nucleic acid including a region of 
nucleotide sequence which hybridizes to a sense or antisense sequence of said 
gene, or naturally occuring mutants thereof, or 5' or 3* flanking sequences 
naturally associated with said gene; 

15 ii. combining said probe with nucleic acid of said cell sample; and 

iii. detecting, by hybridization of said probe to said cellular nucleic acid, the 
existence of at least one of a deletion of one or more nucleotides firom said 
gene, an addition of one or more nucleotides to said gene, a substitution of 
one or more nucleotides of said gene, a gross chromosomal rearrangement of 
20 all or a portion of said gene, a gross alteration in the level of an mRNA 

transcript of said gene, or a non-wild type splicing pattern of an mRNA 
transcript of said gene. 

54. The assay of claim 53, wherein hybridization of said probe further comprises 
subjecting the probe and cellular nucleic acid to a polymerase chiin reaction (PCR) 

25 and detecting abnormalities in an amplified product. 

55. The assay of claim 53, wherein hybridization of said probe further comprises 
subjecting the probe and cellular nucleic acid to a ligation chain reaction (LCR) and 
detecting abnormalities in an amplified product. 

56. The assay of cleiim 53, wherein said probe hybridizes under stringent conditions to 
30 the nucleic acid designated by SEQ ID No, 1 . 

57. An assay for screening test compounds to identify agents which inhibit the 
deacetylation of histones comprising: 



wo 97/35990 



PCT/US97/05275 



-129- 

i. providing a reaction mi^cture including a histonc deacetylase activity of an 
HDxAike polypeptide, a substrate for a histone deacetylase, and a test 
compound; and 

ii. detecting the conversion of the substrate to product, 

5 wherein a statistically signficant decrease in the conversion of the substrate in the 

presence of the test compound is indicative of a potential inhibitor of histone 
deacetylation. 

58. The assay of claim 57, wherein the HDx-Iike polypeptide is of mammalian origin. 

59. The assay of claim 57, wherein the HDx4ike polypeptide is an RPD3-Iike 
10 deacetylase of fiingal origin. 

60. The assay of claim 57, wherein the reaction mixture is a reconstituted protein 
mixture. 

61. The assay of claim 57, wherein said reaction mixture is a cell lysate. 

62. The assay of claim 57, wherein the HDx-like polypeptide is a recombinant proteb. 

15 63. An assay for screening test compounds to identify agents which inhibit histone 
deacetylase interaction with cellular proteins, comprising: 

i. providing a reaction mixture including an HDx-hke protein, an HDx- 
binding protein, and a test compound; and 

ii. detecting the interaction of the HDxAike protein and the HDx binding 
20 protein, 

wherein a statistically signficant decrease in the interaction of the proteins in the 
presence of the test compound is indicative of a potential inhibitor of a histone 
deacetylase. 

64. The assay of claim 63, wherein the HDx-like protein is of mammalian origin. 

25 65. The assay of claim 63, wherein the HDx-like polypeptide is an RPD3-like 
deacetylase of fungal origin. 

66. The assay of claim 63, wherein the HDx-like protein is a histone, or a portion 
thereof which interacts with an HDx-liko polypeptide. 

67. The assay of claim 63, wherein the HDx-like protein is an PbAp48 protein, or a 
30 portion thereof which interacts with an //Dx-like polypeptide. 

68. The assay of claim 63, wherein the reaction mixture is a reconstituted protein 
mixture. 
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69. 
70. 
71. 

5 72. 
73. 

10 

74. 



The assay of claim 63, wherein said reaction mixture is a cell lysate. 

The assay of claim 63, wherein the HDx4ike polypeptide is a recombinant protein. 

The assay of claim 63, wherein one or both of the /ffibtr-like protein and HDx- 
binding protein is a fusion protein. 

The assay of claim 63, wherein at least one of the /ttbf-like protein and HDx- 
binding protein comprises an endogenous detectable label for detecting the 
fonnation of said complex. 

The method of claim 63, which reaction mixture is a whole cell, and interaction of 
the /tt)x-like protein and 7/Dr-binding protein is detected in a two hybrid assay 
system . 

A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-Cg alkylidenes, C^-C^ alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or represent a C2-C7 alkylidene, a C2- 
C7 alkenylidene or a C2-C7 alkynylidene, and E represents O, S, or NR*, in which 
R' represents H, a lower alkyl, a lower alkenyl, a lower alkynyl, an aralkyl, aryl, or 
a heterocyclyl; and 

C is selected from the group consisting of 



acid; in which Z represents O, S, or NR5, and Y; R5 represents a hydrogen, an 
alkyl, an alkoxycarbonyl, an aryloxycarbonyl, an alkylsulfonyl, an aiylsulfonyl or 
an aryl; R g represents hydrogen, an alkyl, an alkenyl, an alkynyl or an aryl; and R7 
represents a hydrogen, an alkyl, an aryl, an alkoxy, an aryloxy, an amino, a 
hydroxylamino, an alkoxylamino or a halogen; with the proviso that the 
compound is not trapoxin. 




H 

.R7 /N. 




, and a boronic 
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5 76, 
10 

77. 

15 
20 



A pharmaceutical preparation comprising (i) the composition of claim 74 in an 
amount effective for inhibiting proliferation of a cell, and (ii) a pharmaceutically 
acceptable diluent. 

A method for modulating one or more of growth, diflFerentiation, or survival of a 
mammalian cell responsive to -HDr-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the cpmpisition of claim 74 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 
rate of survival of the cell. 

A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-C8 alkylidenes, C4-Cg alkenylidenes, C4-Cg alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or represent C2-C7 alkylidenes, C2-C7 
alkenylidenes or C2-C7 alkynylidenes, and E represents O, S, or NR*, in which R' 
represents H, a lower alkyl, a lower alkenyl, a lower alkynyl, an aralkyl, an aryl, 
or a heterocyclyl; and 

C is selected from the group consisting of 




, , , in which R9 represents a hydrogen, 

an alkyl, an aryl, a hydroxyl, an alkoxy, an aryloxy or an amino, 

with the proviso that the inhibitor compound is not trichostatin. 



78. 

30 



A pharmaceutical preparation comprising (i) the composition of claim 77 in an 
amount effective for inhibiting proliferation of a cell, and (ii) a pharmaceutically 
acceptable diluent. 
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79. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to //Z>x-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the compisition of claim 77 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
5 agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 

rate of survival of the cell. 



80. A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

10 A is selected firom the group consisting of cycloalkyls, unsubstituted and 

substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-Cg alkylidenes, C4-Cg alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or a C2-C7 aikylidene, a C2-C7 
15 alkenylidene, or a C2-C7 alkynylidene, and E represents O, S, or NR', in which R* 

is H, lower alkyl, lower alkenyl, lower alkynyl, aralkyl, aryl, or heterocyclyl; and 

Y 




C represents ; in which Y is O or S, and R7 represents a 

hydrogen, an alkyl, an aryl, an alkoxy, an aryloxy, an amino, a hydroxylamino, an 
alkoxylamino or a halogen. 

20 

81. A pharmaceutical preparation comprising (i) the composition of claim 80 in an 
amount eflfective for inhibiting proliferation of a cell, and (ii) a pharmaceutically 
acceptable diluent. 

25 82. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to //Dx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the compisition of claim 80 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 

30 rate of survival of the cell. 
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Figure I A 
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Figure IB 
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Figure 2A 
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Figure 2B 
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Figure 3B 
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Figure 4A 
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Figure 4B 
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Figure 4C 
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Figure 4D 
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Sequences 

1> HDl 2> HSCX1A021 3> R21136 

5> R1B769 6> D31480 7> R98879 



4> F07807 
8> N59055 



50 

1> ATGGCGCAGACgCAGGGCACCCGGAGGAAAGTCTGTTACTACTACGACGGGGATGTTGGA 
6 > ATGGCGCAGACGCAGGGCACCCGGAGGAAAGTNTGTTACTACTACGACGGGGATGTTGGA 

100 

1> AATTACTATTATGGACAAGGCCACCCAATGAAGCCTCACCGAATCCGCATGACTCATAAT 
6 > AATTACTATTATGGACAAGGCCACCCAATGAAGCCTCACCGAATCCGCATGACTCA^ 

150 

1> TTGCTGCTCAACTATGGTCTCTACCGAAAAATGGAAATCTATCGCCCTCACAAAGCCAAT 

6 > TTGCTGCTCAACTATGGTCTCTACCGAAAAATGGAAATCTATCGNCCTCACAAAGCCAAT 

200 

1> gCTGAGGAGATGACCAAGTACCACAGCGATGACTACATTAAATTCTTGCGCTCCATCCGT 
^.r^^^^ ATTGACTTCCTGCAGAGAGTCAGC 
6> NCTGAGGAGATGACCAAGTANCACAGCGATGAC 

^ TCCTGCAGAGAGTCAGC 

250 . . 

1> CCAGATAACATGTCGGAGTACAGCAAGCAGATGCAGAGATTCAACGTTGGTGAGGACTGT 
2> CCCACCAATATGCAAGGCTTCACCAAGAGTCTTAATGCCTTCAACGTAGGCGATGA^^ 

CGATGACTGC 

7 > CCCACCAATATGCAAGGCTTCACCAAGAGTCTTAATGCCTTCAACGTAGGCGATGACTGC 



1 > CCAGTATTCGATGGCCTGTTTGAGTTCTGTCAGTTGTCTACTGG 



359 

- TGGTTCTGTGGCAAGT 

2 > CCAGTGTTTCCCGGGCTCTTTGAGTTCTGCTCGCGTTACACAGGCGCATCTCTGCAAGGA 
5 > CCAGTGTTTCCCGGGCTCTTTGAGTTCTGCrCGCGTTACACAGGCGCATCTCTGC^^ 
7> CCAGTGTTTCCCGGGCTCTTTGAGTTCTGCTCGCGTTACACAGGCGCATCTCTGC^^ 



400 

1 > GCTGTGAAACTTAATAAGCAGCAGACGGACATCGCTGTGAATTGGGi 



2 > GCAACCCAGCTGAACAAi^GATCrGTGATATTGCCATTAACTGGGCTGGTGGTCTGC^^ 
5 > GCAACCCAGCTGAACAACAAGATCTGTGATATTGCCATTAACTTGGCTGGTGGCTTNAAC 
7 > GCAACCCAGCTGAACAACAAGATCTGTGATATTGCCATTAACTGG 



CTGGGGGQCTGCAC 
CTGGTGGTCTGCAC 
CTGGTGGCTTNAAC 
GCTGGTNGTCTGCAC 



450 

1 > CATGCAAAGAAGTCCGA6GCATCTGGCTTCTGTTACGTCAATGATATCGTCTTGGCCATC 

2 > CATGCCTAGAAGTTTGAGGCCTCTGGCTTCTGCTATGTCAACGACATTGTGTTTGGC^^ 
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Tigure 5A (cont) 



5 > NATGCCANGANGTTTNAGGCCTCTGGNTTCTGCTATGTCAACGACATTGTGATTGGCATC 
7 > CATGCCAAGAAGTTTGAGGCCTCTGGTTTCTGCTATGTCAACGACATTGTGATTGGCATC 

500 . . 

1 > CTGGAACTGCTAAAGTATCACCAGAGGG tgCTGTACATTGACATTGATATTCACCATGGT 

2 > CTGGAGCTGCTCAAGTACCACCCTCGGGTGCTCTACATTGACATTGACATCCACCATGGT 
5 > CTGGAGCTGCTCAAGTACCACCCTCGGGTGCTCTACATTGACATTGACATCCACCATGGT 
7 > CTGGAGCTGCTCAAGTACCACCCTCGGGTGCTCTACATTGACATTGACATCCACCA 



• • • • .600 

1> GACGGCGTGGAAGAGGcCTTCTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAG 
2> GACGGGGTTCAAGAAGCTTTCTACCTCACTGACC 

5 > G ACGGGGTTCAAGAAGCTTTCTACCTCACTGACCGGGTCATG ACGGTGTCCTTTCCACAA 
^ ^ > CTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAG 

650 

1 > TATGGAGAGTACTTCCCAGGAACTGGGGACCTACGGG ATATCGGGGCTGGCAAAGGCAAG 
5 > ATACGGGAAATTTACTTNTTCCNGGGGCACAGGTGACATGTTNTGGAAGTTCGGGGGGCA 
1 0 > TATGGAGAGTACTTCCCAGGGACTTGGGACCTACGGGATATCGGGGCTGGCAAAGGCAAG 



700 

1 > TATTATGCTGTTAACTACCCGCTCCGAGACGGGATTGATGACGAGTCCTATGAGGCCATT 
3 > TACTACTGTCTGAACGTGCCCCTGCGGATGGGCATTGATGACCAGAGTTACAAGCACCTT 
5> GGAGAGTTGGCCC 

10 > TATTATGCTGTTAACTACCCGCTCCGAGACGGGATTNATGACGAGTCCTATGAGGCCATT 

750 

1 > TTCAAGCCGGTCATGTCCAAAGTAATGGAGATGTTCCAGCCTAGTGCGGTGGTCTTACAG 
3 > TTCCAGCCGGTTATCAACCAGGTAGTGGACTTCTACCAACCCACGTGCATTGTGCTCCAG 
® ^ CCCTATAGTGAGTCGTATTNN 
1 0 > TTCAAGCCGGTCATGTCCAAAGTAATNGAGATGTTCCAGCCTAGTGCG 

800 

1> TGTGGCTCAGACTCCCTATCTGGGGATCGGTTAGGTTGCTTCAATCTAACTATCAAAGGA 
3 > TGTGGAGCTGACTCTCTGGGCTGTGATCGATTGGGCTGCTTTAACCTCAGCATCCGAGGG 

8 > TNAAAACATGACTCACTNGGNTNNNTACGATTGGGCTGCTTTAACCTCAGCATCCGAGGG 
9> 

AGGT 

' ^^0 , . . ,900 

1> CACgCCAAGTGTGTGQAATTTGTCAAGAGCTTTAACCTGCCTATGCTGATGCTGGGAGGC 

3 > CATGGGGAATGCGTTGAATATGTCAAGAGCTTCAATATCCCTCTACTCGTGCTGGGTGGT 

4 > GGA 
e > CATGGGNAATGCGTTGAATATGTCAAGAGCTTCAATATCCCTCTACTCGTGCTGGGTGGT 

9 > NATGCrAAATGTGTAGAAGTTGTAAAAACTTTTAACTTACCATTACTGATGCT^ 
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Figure 5 A (cont) 

950 

1 > GGTGGTTACACCATTCGTAACGTTGCCCGGTGCTGGACATATGAGACAGCTGTGGCCCTG 

3 > GGTGGTTATACTGTCCGAAATGTTGCCCGCTGCTGGACATATGAGACATCGCTGCTGGTA 

4 > GGTGGCTACACAATCCGTAATGTTGCTCGATGTTGGACATATGAGACTGCAGTTGCCCTT 

8 > GGTGGTTATACTGTCCGAAATGTNGCCCGCTGCTGGACATATGAGACANCGCTGCTGGTA 

9 > GGTGGCTACACAATCCGTAATGTTGCTCGATGTTGGACATATGAGACTGCAGTTGCCCTT 

1000 

1 > GATACGGAGATCCCTAATGAGCTTCCATACaATGACTACTTTGAATACTTTGGACCAGAT 

3 > GAAGAGGCCATTAGTGAGGAGCTTCCCTATAGTGAATACTTCGAGTACTTTGCCCCAGAC 

4 > GATTGTGAGATTCCCAATGAGTTGCCATATAATGATTACTTTGAGTATTTTGGACCAGAC 

8 > GAAGAGGCCATTAGTGAGGAGCTTCCCTAATAGTGAATACTTCGOTACTTTGCCCCAGAC 

9 > GATTGTGAGATTCCCAATGGTAAGTGTTCTCATTACAATATCTTTATTGTATG 

1050 

1 > TTCAAGCTCCACATCAGTCCTTCCAATATGACTAACCAGAACACGAATGAGTACC tGGAG 
3> TTCACACT 

4 > TTCAAACTGCATATTAGTCCTTCAAACATGACAAACCAGAACAC 

8 > TTCAC7VCTTCATCCANATGTCAGCACCCGCATCGAGAATCCAGAACTCACGCCAGTATC 

1100 

1 > AAGATCAAACAGCGACTGTTTGAGAACCTTAGAATGCTGCCGCACGCACCTGGGGTCCAA 
8 > NGGACCAAGATCCGCCAGACAATCTTTGNAAACCTGAAGGTTCTTNAACC 

1150 .... 1200 

1 > ATGCAGGCGATTCCTGAGGACGCCATCCCTGAGGAGAGTGGCGATGAGGACGAAGACGAC 

1250 

1 > CCTGACAAGCGCATCTCGATCTGCTCCTCTGACAAACGAATTGCCTGTGAGGAAGAGTTC 

1300 

1 > TCCGATTCTGAAGAGGAGGG AGAGGGGGGCCGCAAGAACTCTTCCAACTTCAAAAAAGCC 

1350 

1 > AAGAGAGTCAAAACAGAGGATGAAAAAGAGAAAGACCCAGAGGAGAAGAAAGAAGTCACC 

1400 

1> GAAGAGGAGAAAACCAAGGAGGAGAAGCCAGAAGCCAAAGGGGTCAAGGAGGAGGTCAAG 
1> TTGGCCTGA 



9> F06693 
10> H05234 
11> R21136 
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Figure 5B 



(1) ---n»qtqGTRRKVCYYYIX3DVGKYYYGQGHPMKPHRIW1THNLLLNYGLyRK 

t i) (nvyeatpfdpITVKPSDKRRVAYFYDADVGNYAYGAGHPMKPHRIRMAHSLIMNYGLYKK 
3C_rpd3 ( 1) MALTIXSTKKKVCYYYDGDVGlTfyYGOGHPMKPHRIIWTHNLLUT^G^^ 

IDFLQRVSPTNMQGFTKSLNAFNVGDDCPVFPGLFEFC 
(51) MEIYRPHKAKAEE^r^KVHSDDYIKFLRSIRPO^IMSEYSK0M0RFNVGEDCPVFDGLFEFC 
^P^^ ( 61) MEIYRAKPATKQEMCOFHTDEYIDFLSRVTPDMLEMFKRESVKFNVGDDCPVFDGLYEYC 

X_rpd3 { 51) MEIFRPHKASAEDMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFC 

<^^^^> SRYTGASLQGATOLNNKICDXAINWAGGLHHAKKFEASGFCYVNDIVFGILELLKYHPRV 

(111) OLSTGGSVASAVKUiKQQTDIAWWAGGLHKAKKSEASGFCYVNDIVLAILELLKYHQRV 

^^^^ t 121) SISGGGSMEGAARLNRGKCDVAVNYAGGLHHAKKSEASGFCYLNDIVLGIIEIXRYHPRV 

X_rpd3 ( 111) QLSAGGSVASAVKLNKQOTDISVNWSGGLHHAKKSEASGFCYVNDIVLAILELLKYHORV 

YYCLNVPLRW 

<£stA> LYIDIDIHHGDGVOEAFYLTDRVMTVSFPQIREIY 

( 171) LYIDIDIHHGDGVEEAFYTTDRVHTVSFHKYCEYFPGTGDLRDICAGKGKYYAWNYPLRD 
RPD3 ( 181) LYIDIDVHHGDGVEEAFYTTDRVrrrCSFHKYGEFFPGTGELRDIGVGAGKNYAVNVPLRD 

x_rpd3 t 171> VYIDIDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGXYYAVNYALRD 

"^^^^^^ NLLVU3HAKCVEWKT 

*^tB> GIDDQSYKHLFOPVINQWDFYQPTCIVLQCGADSLGCDRLGCFNLSIRGHGECVEYVKS 

( 231) GIDDESYEAIFKPVMSKVMEMFQPSAWLQCGSDSLSGDRLGCFNLTIKGHAKCVEFVKS 

( 241) GIDDATYRSVFEPVIKKIMEWYQPSAVVWJCGGDSLSGDRLGCFNLSKEGHANCVNYVKS 

X_rpd3 ( 231) GIDDESYEAIFKPVMSKVMEMFOPSAWLOCGADSLSGDRLGCFNLTIKGKAKCVEFIKT 

*3/4> n^LPLl>IUX;GGYTIRhWARC>rrYETAVALDCEIPNELPYm>yFEYFGPDFKLHISPSNM 
< 3 > FNI PLLVLGGGG YTVRNVARCHTYETSLLVEEAISEELPYSEYFEYFAPDFTLHP 

( 291) FNLPMU1UMGGYTIRNVARCWTYETAVALDTEIPNELPYNDYFRYFGPDFKLHISPSNM 

t 301) FGIPMMWGGGGYTMRNVARTTrcFETGLLNNWU3ia>LPYNEYVEYY 

x_rpd3 ( 291) FNLPLLMLGGGGYTIRNVARCVTTYETAVALDSEIPNELPYNDYFEYFGPDFKLHISPSNM 

<EstC> TNQN 

<EstB> ( 351) TNONTKEYLEKIKQRLFENLRMLPHAPGVQMQAIPEDAIPEESCDEDEDDPDKRISICSS 

t 3S1) FNVOTPEYU3KVMTNlFANLENTKYAPSVQLNHTPRDaedlgdveedsaeakdtkggsqy 

X_rpd3 ( 3S1) TNONTNEYLEKIKORLFENLRMLPHAPGVQMQAVAEDSIHDDSGEEDEDDPDKRISIRSS 

(411) DKRIACEEEFSDSEEEGEGGRKNSSNFKKAKRVKTEDEREJcdPEEKKEVTEEEKTKEEKP 
RPD3 ( 421) ardlhvehdnef y - _ 

X_rpd3 ( 411) DKRIACDEEFSDSEDEGEGGRKNVANrKKVKRVKTEEEKE--GEDKKDVKEEEKAKDEKT 

HDl ( 4 71) EAKGVKEEVRla 

RPD3 ( 4 34) 

X_rpd3 ( 46 9) DSKRVKEETKsv 



wo 97/35990 

15/26 



Figure 6 

Specificity Element A Linking Element B 
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Figure 7 
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Figure 8A 
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Figure 8B 




1 HX:.Z^MM,HOfiT 

2 LiOaTHf.Mii>H,H^ 

3 H^JO*IMCTHF 
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Figure 8C 
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Figure 9A 
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Figure 9B 
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Figure 9C 




K-trap affinity matrix 
30 
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Figure 10 
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Figure 11 





